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RESULT 1 
AAY99418 

ID AAY99418 standard; protein; 761 AA. 
XX 

AC AAY99418; 
XX 

DT 08-AUG-2000 (first entry) 
XX 

DE Human PR01317 (UNQ783) amino acid sequence SEQ ID NO: 277. 

XX 

KW Human; PRO polypeptide; membrane bound protein; receptor; diagnosis; 

KW transmembrane; secretion; immunoadhesion; pharmaceutical; screening. 
XX 

OS Homo sapiens. 
XX 

PN WO200012708-A2. 
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XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker K, Goddard A, Gurney AL, Smith V, Watanabe CK, Wood WI; 
XX 

DR WPI; 2000-237871/20. 

DR N-PSDB; AAA37100. 
XX 

PT New mammalian DNA sequences encoding transmembrane, receptor or secreted 

PT PRO polypeptides, useful for screening of potential peptide or small 

PT molecule inhibitors of the relevant receptor/ligand interactions. 
XX 

PS Claim 12; Fig 158; 773pp; English. 
XX 

CC AAA37 022 to AAA37144 encode the new isolated human transmembrane, 

CC receptor or secreted PRO polypeptides given in AAY99340 to AAY99462. The 

CC transmembrane and receptor PRO proteins can be used for screening of 

CC potential peptide or small molecule inhibitors of the relevant 

CC receptor/ligand interactions. The polypeptides and nucleotide sequences 

CC encoding then have various industrial applications, including uses as 

CC pharmaceutical and diagnostic agents. AAA37145 to AAA37330 represent PCR 

CC primers and hybridisation probes used in the isolation of the PRO 

CC polypeptides from the present invention 

XX 

SQ Sequence 761 AA; 



Query Match 100.0%; Score 4031; DB 3; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 ! I I I I I I I I I I I I I I I I I 

Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 18 0 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 



Qy 



181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



181 AHKHTAVLVDGMLYS GTMNNFLGS EP I LMRTLGSQPVLKTDNFLRWLHHDAS FVAAI P ST 240 



Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I | I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

| | | | | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
Db 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I I 1 I I I I I I 1 I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWS GDS SAHLVEEI QLFPDPEPVRNLQLAPTQGA 4 80 

Qy 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
Db 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 60 0 

I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I li I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARG KVQGC E 720 

I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I 
Db 661 VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ G C E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 



RESULT 2 
AAB24084 

ID AAB24084 standard; protein; 761 AA. 
XX 

AC AAB24084; 
XX 

DT 29-JAN-2001 (first entry) 
XX 

DE Human PR01317 protein sequence SEQ ID NO: 71. 

XX 

KW Human; tumour; diagnosis; neoplastic disease; neoplastic cell growth; 

KW proliferation; tumourigenesis ; identification; cancer; cytostatic; 

KW nootropic; neuroprotective; antiinflammatory; immunosuppressive; 

KW immunostimulant; antiangiogenic; leukaemia; lymphoid malignancy; 

KW neuronal disorder; glial disorder; astrocytal disorder; angiogenic; 

KW hypothalamic disorder; glandular disorder; macrophagal disorder; 

KW epithelial disorder; stromal disorder; blastocoelic disorder; 

KW inflammatory disorder; immunologic disorder. 



XX 

OS Homo sapiens. 
XX 

PN WO200053755-A2. 
XX 

PD 14-SEP-2000. 
XX 

PF 06-JAN-2000; 2000WO-US000376 . 
XX 

PR 08-MAR-1999; 99WO-US005028 . 

PR 02-JUN-1999; 99WO-US012252 . 

PR 23-JUN-1999; 99US-0141037P . 

PR 07-JUL-1999; 99US-0143048P . 

PR 26-JUL-1999; 99US-0145698P . 

PR 30-NOV-1999; 99WO-US028313 . 

PR 20-DEC-1999; 99WO-US030911 . 

PR 05-JAN-2000; 2 000WO-US0002 19 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi A J, Baker KP, Godciard A, Gurney AL, Hillan KJ, Roy MA; 

PI Watanabe CK, Wood WI; 

XX 

DR WPI; 2000-572270/53. 

DR N-PSDB; AAC58392. 
XX 

PT Thirty PRO polynucleotides encoding PRO polypeptides , useful in the 

PT treatment, diagnosis and prevention of cancer. 

XX 

PS Claim 61; Fig 52; 286pp; English. 
XX 

CC The present invention describes an isolated antibody that binds to one of 

CC the human PRO proteins designated PR0212, PRO290, PR0341, PR0535, PR0619, 

CC PR0717, PRO809, PRO830, PR0848, PR0943, PRO1005, PRO1009, PRO1025, 

CC PRO1030, PRO1097, PRO1107, PROllll, PR01153, PR01182, PR01184, PR01187, 

CC PR01281, PR023, PR039, PR0834, PR01317, PRO1710, PRO2094, PR02145 OR 

CC PR02198. PRO antagonists can be used to inhibit tumour cell growth. The 

CC PRO polypeptides and nucleotides are useful in the treatment, diagnosis 

CC and prevention of cancer. The antibodies and other anti-tumour compounds 

CC maybe used to treat various conditions, including those characterised by 

CC overexpression and/or activation of the amplified PRO genes. Exemplary 

CC conditions or disorders to be treated with such antibodies and other 

CC compounds include benign or malignant tumours (e.g., renal, liver, 

CC kidney, bladder, breast, gastric, ovarian, colorectal, prostate, 

CC pancreatic, lung, vulva, thyroid, hepatic carcinomas, sarcomas, 

CC glioblastomas, and various head and neck tumours), leukaemias and 

CC lymphoid malignancies, other disorders such as neuronal, glial, 

CC astrocytal, hypothalamic and other glandular, macrophagal, epithelial, 

CC stromal and blastocoelic disorders, and inflammatory, angiogenic and 

CC immunologic disorders. AAC58242 to AAC58366 represent PCR primers and 

CC hybridisation probes used in the isolation of the human PRO sequences. 

CC AAC58367 to AAC58396 and AAB24057 to AAB24089 represent human PRO 

CC polynucleotide and protein sequences given in the exemplification of the 

CC present invention 

XX 

SQ Sequence 7 61 AA; 



Query Match 100.0%; Score 4031; DB 3; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 761; Conservative 0; Mismatches 0; 



Length 7 61; 
Indels 0; Gaps 



0; 



Qy 


1 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 






1 


MAT.PAT.nT.DPWSI.LGT.FLFOLLOT.LLPTTTAGGGGOGPMPRVRYYAGDERRALSFFHOKG 

i irVJj XT _/y I 1 vj i_l X/ XT V V JLJ 1 1 VJ J_l X XJ X S£ XJ v> l_i J_| -i-i X X X X x Is-* V-^ ^ >— ' >-> x. x J. J- a \ v x \ -i- x x>w* j-* j-j a \x ^x xj_ i ' x. i> x x ^£ x v v—» 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 




UD 


61 


T DDFnTT T T ^nn^NTT WHARF AT T, AT.DTOnP^VPRT.KNMT PWPAS DRKKS ECAFKKKSN 


120 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 




UD 


191 

± -L 


FTnnFNFTRVT ,V^YM\/TKT.YTr(^T , FAF c 5PArTFTFiTiODSYLLPISEDKVMEGKGOSPFDP 


180 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 




1 I I I I | | I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


1 Q 1 


AU'&r'HT 1 AWT '\/"nr 4 MT VQPTMM'NTFT rQPPTT MDTT C^nPVT.T<'TnMFT.RWT,HHT)A c ?F\/'A AT P 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKIWTTFLKAQLLCTQPGQLP 


300 






1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 




UD 


0/11 
Z *± X 


n"\A7VTT , "R 1 T7 1 'fi , F r PZi cjT?TrnFFFT3T WT QDUa P\/r N T^MnV^nFT<'TT.nK'KWTT FT.K AOT ,T.r*TO PriOT.P 


300 


Qy 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


^ni 


FMVT RTT AVT T PAT)^ PTAPTTTYAV FT S DWOVn^T P S ^ AVC AFST.T.DT ERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


Q£i 


TC'DTnTTTYDrPPTMPRPr^rwCP^^nKaT TFMKnHFT,MnFO\A/(^TPTiT,VKS(^VEYTRT.AV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


A 9 1 
*i Z ± 


FTAflHT nr^T-T^HT VMYT.f^TTTnST.HKAXA/^ftD 0 ? ^ AHT.VFFTOT.FPDPEPVRNT.OLAPTOGA 

It 1 s\\J UJjJJ on Oflli V l v l 1 Jjul X X uOXnivrtV V O OX> O OjtXIXj VJ_iXjX^>XjX XTX^J-J-jIT v jaim U\/xj^\x: ± ^/^jt^. 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UtJ 


4 ft 1 
*± o X 


VFA/'HFSnnWRVPRANrSVYFSrVDrVT.ARDPHCAWDPESRTCCLLSAPNLNSWKODMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 






1 


HNPFWArAS^PMSRSLRPOSRPOIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II II 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 








1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 




Db 


721 


TLRPGEKAPLSREQHLQS PKECRTSAS DVDADNNCLGTEVA 7 61 





RESULT 3 
AAB66167 

ID AAB66167 standard; protein; 761 AA. 
XX 

AC AAB66167; 
XX 

DT 02-APR-2001 (first entry) 
XX 

DE Protein of the invention #79. 
XX 

KW Secreted; transmembrane; gene therapy. 
XX 

OS Unidentified. 
XX 

PN WO200078961-A1. 
XX 

PD 28-DEC-2000. 
XX 

PF 18-FEB-2000; 2000WO-US004342 . 
XX 

PR 23-JUN-1999 



99US-0141037P. 
99US-0144758P. 
99US-0145698P. 
99WO-US020111. 
99US-0162506P. 
99WO-US028313. 
99WO-US028551. 
99WO-US030095 . 
2000WO-US000219. 
2000WO-US000376. 



PR 20-JUL-1999; 

PR 26-JUL-1999; 

PR 01-SEP-1999; 

PR 29-OCT-1999; 

PR 30-NOV-1999; 

PR 02-DEC-1999; 

PR 16-DEC-1999; 

PR 05-JAN-2000; 

PR 06-JAN-2000; 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 

PI Gao W, Goddard A, Godowski PJ, Grimaldi CJ, Gurney AL, Hillan KJ; 

PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 

PI Williams PM, Wood WI; 
XX 

DR WPI; 2001-071395/08. 
XX 

PT Secreted and transmembrane proteins and nucleic acids designated PRO, 

PT useful as hybridization probes, in chromosome and gene mapping and gene 

PT therapy. 
XX 

PS Claim 1; Fig 158; 787pp; English. 
XX 

CC The present invention relates to secreted and transmembrane proteins. 

CC These proteins and the DNA encoding them may be used as hybridization 

CC probes, in chromosome and gene mapping and in the generation of anti- 

CC sense RNA and DNA. They may also be used used to generate either 

CC transgenic animals or knockout animals which are in turn useful for 

CC development and screening of therapeutically useful reagents. The nucleic 

CC acids may also be used in gene therapy 

XX 

SQ Sequence 761 AA; 



Query Match 



100.0%; Score 4031; DB 4; Length 761; 



Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGS EPI LMRTLGSQPVLKTDNFLRWLHHDAS FVAAI P ST 


240 






1 II I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPI LMRTLGSQPVLKTDNFLRWLHHDAS FVAAI P ST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FN VI RHAVL L PAD S P TAP H I YAVFT S QWQ VGGT RS S AVC AF S L L D I E RVF KGK Y K E LN KE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 






1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






1 ! 1 I 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I M 1 1 1 1 1 1 1 1 ! 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






1 I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS GGAALAAQQ S YWPH FVT VTVL FALVL S GAL 1 1 LVAS P LRALRARGKVQGCE 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW PH FVT VTVL FALVL S GAL 1 1 LVAS P LRALRARGKVQGCE 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 








1 1 1 I 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





RESULT 4 



AAM80295 

ID AAM80295 standard; protein; 761 AA. 
XX 

AC AAM80295; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human protein SEQ ID NO 3947. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; gene therapy; 

KW vaccine; peptide therapy; stem cell growth factor; haematopoiesis; 

KW tissue growth factor; immunomodulatory; cancer; leukaemia; 

KW nervous system disorder; arthritis; inflammation. 

XX 

OS Homo sapiens. 
XX 

PN WO200157190-A2. 
XX 

PD 09-AUG-2001. 
XX 

PF 05-FEB-2001; 2001WO-US004098 . 
XX 

PR 03-FEB-2000; 2000US-00496914 . 

PR 27-APR-2000; 2000US-00560875 . 

PR 20-JUN-2000; 2000US-00598075 . 

PR 19-JUL-2000; 2000US-00620325 . 

PR 01-SEP-2000; 2000US-00654936. 

PR 15-SEP-2000; 2000US-00663561 . 

PR 20-OCT-2000; 2000US-00693325 . 

PR 30-NOV-2000; 2000US-00728422 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT, Asundi V, Zhou P, Xu C, Cao Y; 

PI Ma Y, Zhao QA, Wang D, Wang J, Zhang J, Ren F, Chen R, Wang ZW; 

PI Xue A J, Yang Y, Wejhrman T, Goodrich R; 

XX 

DR WPI; 2001-476283/51. 

DR N-PSDB; AAK53428. 
XX 

PT Nucleic acids encoding polypeptides with cytokine-like activities, useful 

PT in diagnosis and gene therapy. 

XX 

PS Claim 20; Page 6209-6210; 6221pp; English. 
XX 

CC The invention relates to polynucleotides (AAK51456-AAK534 35 ) and the 

CC encoded polypeptides (AAM7 832 3-AAM8 0302 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations. The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 

CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 

CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. Note: Records for SEQ ID NO 2110 (AAK52581), 2111 

CC (AAK52582) and 3666 (AAM80020) are omitted as the relevant pages from the 



CC sequence listing were missing at the time of publication 
XX 

SQ Sequence 761 AA; 

Query Match 100.0%; Score 4 031; DB 4; Length 7 61; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




DO 


X 


MAT PAT <ZJ nPTAT^T.T CT FT FOT J.OT.T.T.PTTT AnfcnnOftPMPPVRYYAftnF.PPAT.SFFHnKf? 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




DO 


D 1 


T nnFTWT T T cjmrTvTTT Wf* ARF ATT AT HTHnPHVPRT TOJMT PT/JP A C 5T)RK'K' C SFPAFKK"K < 3N 




Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




DD 


1Z X 


prTiAPTPMPTDI/T \TQ VNR 7T T-T T VTHfTTT A TTQ "D A r*T FT PT OPi QVT T PT Q F'HTA/MF^K'fiO ^ P FTiP 
Ejiy^rlNr X K V Xi VO xiMVlnXiX 1 ULjx eJ\e O rJ\\^± £ X £jXiyjJO I XiXur X OlLUix Vl v lLj^rA^y or r Ur 


X o u 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 Q 1 


T\XltrWP 7\\TT "\7nr*lvrT VQT TTV/TM"NT TTT PQTTPTT WDTT CQHPUT K*T TWFT PTaTT T-TWHA ^ F\/A AT pCIT 


*± U 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




DO 


1 A 1 
*t 1 


y VV X £ E E EjJl»X J r\0 EiE UE E iLKXiil 1 Ol\V/\±\ V k^fvlN U V ^oiliJ\XlXjSdJ^r\-i'V 1 1 J- XjJ\i-v^XjXjw X ^jro^X-iir 


300 


Qy 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Do 


Jul 


TTMWTPTJT AWT T P A n Q PT A PUT V A\7'F r P Q rWnT 7/"* P T P C! Q AVH A F CI T T F>T FPVFKTZ'PfYKFT.TvT'K'F 




Qy 


361 


TS RWTT YRGPETNPRPGS CSVGPS S DKALTFMKDHFLMDEQWGTPLLVKS GVEYTRLAV 


420 






I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




DO 


^ £ i 


T Q PTaTT T VP C P F TTJ P P P C <3 C <Z\T(Z P <=I V\ V A T .T FMKFJH FT Mfl F D\ A/Y^TPT.T.VK^fTVF.YTRT.AV 
X O rvww XXX l\Sj n Hi X IN rrxruOv^o V or O O DL\r\xi x J_ Fllvl^nr XjI UJHiv^ v v ui xxjxjvr\>oyjvXLix x rxxxrv v 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




DO 


Z X 


FT anni n(^N c JHTA/MYT.nTTTri c ;T.HK'A\A/ c ?(^n c i AHT.VFF.TOT.FPTjPFPVRNTjOLAPTOGA 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 




vF\/r^F^(^r;\/wm/ppANr^\/TFsrvnrvTiARnPHrAwnPFSRTCCTiTivSAPNLNSWi^ 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 




1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTWNGSLLLIVQDGVGGLYQCWATENGFSYPVISxWDSQDQTLALDPELAGIPREH 


660 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 



RESULT 5 
AAB66043 

ID AAB66043 standard; protein; 761 AA. 
XX 

AC AAB66043; 
XX 

DT 30-MAR-2001 (first entry) 
XX 

DE Human TANGO 265. 
XX 

KW TANGO protein; INTERCEPT protein; neurological disorder; 

KW central nervous system; focal brain disorder; bipolar affective disorder; 

KW global-diffuse cerebral disorder; cerebrovascular; Alzheimer's disease; 

KW senile dementia; Huntington's disease; amyotrophic lateral sclerosis; 

KW Parkinson's; Gilles de la Tourette's syndrome; hypertension; sleep; 

KW neuropsychiatric; psychoactive substance use; anxiety. 
XX 

OS Homo sapiens. 
XX 

PN WO200077239-A2. 
XX 

PD 21-DEC-2000. 
XX 

PF 24-MAY-2000; 2000WO-US014858 . 
XX 

PR 14-JUN-1999; 99US-00333159 . 
XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 

PI McCarthy SA, Fraser CC, Sharp JD, Barnes TM; 
XX 

DR WPI; 2001-032313/04. 

DR N-PSDB; AAF45125, AAF45126. 

XX 

PT TANGO and INTERCEPT nucleic acids r proteins, and antibodies, useful for 

PT screening assays and diagnostic assays and for the treatment of 

PT neurological diseases such as Alzheimer's, Parkinson's and Huntington's 

PT disease. 

XX 

PS Claim 8; Fig 3; 359pp; English. 
XX 

CC The present invention relates to TANGO or INTERCEPT proteins and coding 

CC sequences (see AAF45121-F45136 and AAF45138-F45139 and AAB66031-B66057 , 

CC AAB66064-B66083 and AAB66085) . The TANGO/ INTERCEPT proteins and coding 

CC sequences are useful for the treatment of neurological disorders such as 

CC central nervous system (CNS) disorders, CNS-related disorders, focal 

CC brain disorders, global-diffuse cerebral disorders and other neurological 

CC and cerebrovascular disorders. The CNS disorders include Alzheimer's 

CC disease, senile dementia, Huntington's disease, amyotrophic lateral 

CC sclerosis, Parkinson's, Gilles de la Tourette's syndrome, autonomic 

CC function disorders such as hypertension and sleep disorders, 

CC neuropsychiatric disorders, psychoactive substance use disorders, 

CC anxiety, and bipolar affective disorder 



XX 

SQ Sequence 761 AA; 



Query Match 100.0%; Score 4031; DB 4; Length 761; 

Best Local Similarity 100,0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


l 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 






1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I ! 1 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 




Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 




1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQG C E 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ G C E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 





Db 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 7 61 



RESULT 6 
ABG72615 

ID ABG72615 standard; protein; 761 AA. 
XX 

AC ABG72615; 
XX 

DT 12-FEB-2003 (first entry) 
XX 

DE Human cytokine or cell prolif eration/dif f erentiation protein #5. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; anaemia; 

KW wound; ulcer; thrombocytopaenia; osteoporosis; osteoarthritis; 

KW inflammation; Alzheimer ! s disease; Parkinson's disease; stroke; 

KW Huntington's disease; amyotrophic lateral sclerosis; HIV; 

KW immune deficiency; human immunodeficiency virus infection; 

KW severe combined immunodeficiency; infection; autoimmune disorder; 

KW rheumatoid arthritis; Guillain-Barre syndrome; graf t-versus-host disease 

KW cancer; thyroid cancer; lung cancer; small cell carcinoma; 

KW Kaposi's sarcoma; brain tumour; prostate cancer; ovarian cancer; 

KW leukaemia; inflammatory disorder; inflammatory bowel disease; 

KW Crohn's disease; food supplement. 

XX 

OS Homo sapiens . 
XX 

PN US2002137044-A1. 
XX 

PD 26-SEP-2002. 
XX 

PF 30-JAN-2001; 2001US-00774434 . 
XX 

PR 03-FEB-2000; 2000US-00496914 . 

PR 27-APR-2000; 2000US-00560875 . 
XX 

PA (TANG/) TANG Y T. 

PA (ZHOU/) ZHOU P. 

PA (GOOD/) GOODRICH R. 

PA (LIUC/) LIU C. 

PA (ASUN/) ASUNDI V. 

PA (WEHR/) WEHRMAN T. 

PA (YANG/) YANG Y. 

PA (DRMA/) DRMANAC R T. 

XX 

PI Tang YT, Zhou P, Goodrich R, Liu C, Asundi V, Wehrman T, Yang Y; 

PI Drmanac RT; 

XX 

DR WPI; 2003-110596/10. 

DR N-PSDB; ABX13628. 
XX 

PT New polynucleotides for diagnosing, treating or preventing e.g. anemia, 

PT wounds, ulcers, thrombocytopenia, osteoporosis, inflammations, 

PT Alzheimer's disease, stroke, autoimmune disorders or cancers. 
XX 

PS Claim 19; Page 57-59; 63pp; English. 
XX 



CC The invention relates to an isolated polynucleotide encoding a cytokine 

CC or cell proliferation/differentiation-related protein (or the mature 

CC protein or active domain) , sequences greater than 90% identical to it or 

CC sequences complementary to it. Also included are the encoded proteins, 

CC expression vectors, transformed host cells, antibodies, and identifying a 

CC compound that binds to the polypeptide. The polynucleotide, polypeptides 

CC encoded by it, or antibodies to the polypeptides are useful in 

CC therapeutic, diagnostic or research methods. They are particularly useful 

CC for diagnosing, treating or preventing e.g. anaemias, wounds, ulcers, 

CC thrombocytopaenia, osteoporosis, osteoarthritis, inflammations, 

CC Alzheimer's disease, Parkinson's disease, Huntington's disease, 

CC amyotrophic lateral sclerosis, stroke, immune deficiencies (e.g. human 

CC immunodeficiency virus (HIV) , severe combined immunodeficiency or 

CC infections), autoimmune disorders (e.g. rheumatoid arthritis, Guillain- 

CC Barre syndrome or graf t-versus-host disease), cancers (e.g. thyroid 

CC cancer, lung cancers, small cell carcinoma, Kaposi's sarcoma, brain 

CC tumours, prostate cancer, ovarian cancer or leukaemias), or inflammatory 

CC disorders (e.g. inflammatory bowel disease or Crohn's disease). The 

CC polynucleotides and proteins are useful for screening peptides or small 

CC molecule inhibitors or agonists that are useful for treating these 

CC diseases. The polypeptide is also useful as molecular markers, or as a 

CC food supplement. The present sequence is a cytokine or cell 

CC proliferation/differentiation-related protein of the invention 

XX 

SQ Sequence 7 61 AA; 

Query Match 100.0%; Score 4031; DB 6; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I 1 I I I I I I I I I I M I I I I I 

Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

|| | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I 

Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I | | || | | | I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I II I I I I 

Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I | | | I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I M I I I I I I I I I I I I 

Db 181 AH KHT AVLVD GML Y S GTMNN FL G S E P I LMRT L G S Q P VL KT DN F L RW LHH D AS FVAAI P S T 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I | | | | | | | | | | | | M I II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I M I I I I I I I 

Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I M | I I I M I I I II I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 

Db 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I | I I I I I I I I I I I I I I I I I II I I I I I II I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 



Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

| | | | | | I I | I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I ! I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

Qy 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

| M | I I I 1 I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I M I i I I I I I I I II I I I 

Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS G GAAL AAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRP GEKAP LS REQHLQ S P KECRT S AS DVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 



RESULT 7 
ABU08366 

ID ABU08366 standard; protein; 761 AA. 
XX 

AC ABU08366; 
XX 

DT 03-JUN-2003 (first entry) 
XX 

DE Amino acid sequence for human TANGO 265. 
XX 

KW Human; TANGO 265; INTERCEPT; cellular process; tissue typing; 

KW forensic biology; cellular protease activity; cell interaction; 

KW development; blood disorder; haematopoietic cell-related disorder; 

KW growth; cell proliferation; cell differentiation; gamma delta T-cell; 

KW immune system; metabolic disorder; homeostatic disorder; anaemia; 

KW developmental bone disorder; osteoporosis; bacterial infection; 

KW thrombocytopaenia; renal failure; liver disease; cystic fibrosis; 

KW obesity; cancer; bronchitis; asthma; emphysema; pulmonary oedema; 

KW respiratory distress syndrome; osteopathic; antibacterial; antianaemic; 

KW thrombolytic; nephrotropic; antiobesity; hepatotropic; cytostatic; 

KW antiinflammatory; antiasthmatic. 

XX 

OS Homo sapiens. 
XX 

PN US2002182675-A1. 
XX 

PD 05-DEC-2002. 
XX 

PF 25-OCT-2001; 2001US-00042431 . 
XX 

PR 14-JUN-1999; 99US-00333159 . 



PR 24-MAY-2000; 2000US-00578063 . 
XX 

PA (MILL- ) MILLENNIUM PHARM INC. 
XX 

PI McCarthy SA, Fraser CC, Sharp JD, Barnes TM; 
XX 

DR WPI; 2003-328617/31. 

DR N-PSDB; ABX94103. 
XX 

PT New TANGO and INTERCEPT proteins, useful as modulating agents in 

PT regulating a variety of cellular processes, in chromosome mapping, in 

PT tissue typing, and in forensic biology. 

XX 

PS Disclosure; Fig 3A-3E; 232pp; English. 
XX 

CC The present invention relates to the isolation of novel TANGO or 

CC INTERCEPT proteins, and the polynucleotide sequences encoding them. The 

CC invention discloses sequences for human TANGO 202, TANGO 234, TANGO 265, 

CC TANGO 273, TANGO 286, TANGO 294 and INTERCEPT 296. The sequences for 

CC murine TANGO 202 and TANGO 273 are also provided. The TANGO polypeptides 

CC and the polynucleotide sequences encoding them are useful as modulating 

CC agents in regulating a variety of cellular processes. The polynucleotide 

CC sequences are useful as primers or hybridisation probes for the detection 

CC of nucleic acids encoding TANGO polypeptides, chromosome mapping, tissue 

CC typing, and in forensic biology. TANGO 202 can be used to diagnose, 

CC prevent or treat disorders relating to aberrant cellular protease 

CC activity, inappropriate interaction of cells with mediators, 

CC inappropriate development, and blood and haematopoietic cell-related 

CC disorders. TANGO 234 can be used to modulate growth, proliferation, 

CC survival, differentiation, and activity of gamma delta T-cells . TANGO 265 

CC can be used to prevent, diagnose and treat disorders characterised by 

CC aberrant organisation or development of a tissue or organ, and for 

CC modulating differentiation of cells of the immune system. TANGO 273 is 

CC useful for diagnosing, treating or preventing e.g. metabolic, homeostatic 

CC and developmental bone disorders (e.g. osteoporosis), and bacterial 

CC infection. TANGO 286 is useful for treating or preventing e.g. anaemia, 

CC thrombocytopaenia, renal failure or liver disease. TANGO 294 is useful 

CC for treating or preventing e.g. cystic fibrosis or obesity. INTERCEPT 296 

CC is useful for diagnosing, treating or preventing e.g. cancers, 

CC bronchitis, cystic fibrosis, asthma, emphysema, pulmonary oedema, or 

CC adult and infant respiratory distress syndromes. The present sequence 

CC represents human TANGO 265 

XX 

SQ Sequence 761 AA; 

Query Match 100.0%; Score 4031; DB 6; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I II I I II I I I I I I I It I I I I I I I I I I I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I II I I II I I II I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 



Qy 


121 


ETQCFNFIRVLVSYNWHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






llllllll MINIMI MMMMMMMMI 




Db 


1 91 


FTOrFNFTRVT.V^YNVTHT.YTCGTFAFSPACTFIELQDSYLLPISEDKYMEGKGOSPFDP 


180 


Qy 


181 


AH KHT AVLVDGMLY S GTMNN FL G S E P I LMRT LGS Q P VL KT DN FLRWLHHDAS FVAAI P S T 


240 






1 M 1 M M M M M M M 1 M M M M M M M M M M M M M 1 M 




Db 


181 


AHKHTAVL VDGMLYSGTMNNFLGSEP I LMRTLGSQPVLKTDN FLRWLHHDAS FVAAI PST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






M M M M M M M M M M M M M M M 1 M M M M M M M 1 M M M M M M M 




Db 


241 


QVVYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




UkJ 


O \J ± 


FNVTRHAVT.T.PAD^PTAPHT YAVFTSOWOVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UiJ 




T^RWTTYRnPFTWPRPf^^r^V^PSSnKAT.TFT^KDHFTiMDEOVVGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 




DO 


A 0 1 
4 J. 


l^TAnr^T.nnH^HTA/MYT.f^TTT^T.HKA^/ 

J2j X r-\\j/*.^\t\J *-T li O nil VI 1 JLJUOX J- 1. \J*J JLiliXV/T. V V tJ \D U tD O/ll 1 J_l V JjD-L \s J_l J- JLT U Z i_l JLT V XMN XJ^> J_J^TiJU X ^VXT^. 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 




JJJJ 




VT^/^F^f^^/WRVPRANrSVYRSrvnrVT.ARDPHCAWDPESRTCCLLSAPNLNSWKODMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 






1 1 M II II II 1 1 1 1 1 II 1 1 1 1 1 II 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 








GNPFWACASGPMSRSLRPOSRPOI I KEVLAVPNS I LELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 






601 

*J W X 


A^STWNGST.TJjIVODGVGGLYOCWATENGFSYPVISYWVDSODOTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RALRARGKVQ GC E 


720 






1 1 1 1 M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 


Qy 


721 


T L RP GEKAP LSREQHLQSPKECRT S AS DVD ADNN C L GT EVA 761 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 




Db 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





RESULT 8 
ABO32703 

ID ABO32703 standard; protein; 761 AA. 
XX 

AC ABO32703; 
XX 

DT 17-SEP-2003 (first entry) 
XX 

DE Secreted polypeptide-related protein #84. 
XX 

KW Human; TANGO; INTERCEPT; secreted polypeptide; immune disorder; 

KW hormonal disorder; proliferative disorder; cancer; thyroid disorder; 



KW diabetes; multiple sclerosis; lupus; neurological disorder; anaemia; 

KW Alzheimer ! s disease; Parkinson* s disease; cardiovascular disorder; 

KW myocardial infarction; congestive heart disease; blood platelet disorder; 

KW thrombocytopenia; bloodvessel; atherosclerosis; vasculitis. 

XX 

OS Homo sapiens. 
XX 

PN US2003022279-A1. 
XX 

PD 30-JAN-2003. 
XX 

PF 12-JAN-2001; 2001US-00759130 . 
XX 

PR 14-JUN-1999; 99US-00333159 . 

PR 29-JUN-1999; 99US-00342364 . 

PR 10-SEP-1999; 99US-00393996 . 

PR 19-OCT-1999; 99US-00420707 . 

PR 07-JAN-2000; 2 000US-0047924 9 . 

PR 27-APR-2000; 2000US-00559497 . 

PR 24-MAY-2000; 2000US-00578063 . 

PR 16-JUN-2000; 2000US-00596194 . 

PR 23-JUN-2000; 2000US-00602871 . 

PR 30-JUN-2000; 2000US-00608452 . 
XX 

PA (FRAS/) FRASER C C. 

PA (BARN/) BARNES T M. 

PA (SHAR/) SHARP J D. 

PA (KIRS/) KIRST S J. 

PA (MYER/) MYERS P S. 

PA (LEIB/) LEIBY K R. 

PA (HOLT/) HOLTZMAN D A. 

PA (MCCA/) MCCARTHY S A. 

PA (WRIG/) WRIGHTON N. 

PA (MACK/) MACKAY C R. 

PA (GOOD/) GOODEARL ADJ. 
XX 

PI Fraser CC, Barnes TM, Sharp JD, Kirst SJ, Myers PS, Leiby KR; 

PI Holtzman DA, McCarthy SA, Wrighton N, Mackay CR, Goodearl ADJ; 
XX 

DR WPI; 2003-456290/43. 

DR N-PSDB; ACD66780, ACD66781. 

XX 

PT New nucleic acid molecule encoding a secreted protein (e.g. TANGO 2 02, 

PT TANGO 210 or INTERCEPT 217), useful for diagnosing, preventing or 

PT treating disorders such as cancer, diabetes or atherosclerosis, and in 

PT forensic biology. 

XX 

PS Claim 9; Fig 27A-27E; 482pp; English. 
XX 

CC The invention relates to secreted polypeptide-related proteins and 

CC nucleic acids (TANGO and INTERCEPT proteins and nucleic acids) . The 

CC nucleic acids, proteins and antibodies specific to the proteins are 

CC useful in screening assays, predictive medicine (e.g. diagnostic assays, 

CC prognostic assays, monitoring clinical trials and pharmacogenetics) and 

CC prophylactic and therapeutic methods. The sequences are used in 

CC diagnosing, preventing or treating proliferative disorders (e.g. 

CC cancers), hormonal disorders (e.g. diabetes or thyroid disorders), immune 



CC disorders (e.g. multiple sclerosis or lupus), neurological disorders 

CC (e.g. Alzheimer ! s disease or Parkinson's disease), cardiovascular 

CC disorders (e.g. myocardial infarction or congestive heart disease), blood 

CC platelet disorders (e.g. thrombocytopenia or anaemia) and disorders 

CC involving blood vessels (e.g. atherosclerosis or vasculitis). The nucleic 

CC acids may also be used in chromosome mapping, tissue typing and forensic 

CC biology, and as surrogate markers. This sequence represents a secreted 

CC polypeptide-related protein of the invention. Note: The sequence data for 

CC this patent was obtained in electronic format directly from USPTO at 

CC seqdata . uspto . gov/sequence . html 

XX 

SQ Sequence 761 AA; 

Query Match 100.0%; Score 4031; DB 6; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

i I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I i I I I I I I I I I 1 
MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

ETQCFNFI RVLVS YNVTHLYTCGT FAFS PACT FI ELQDS YLLP I S EDKVMEGKGQS PFDP 18 0 
I I II I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
ETQCFNFI RVLVS YNVTTHLYTCGT FAFS PACT FI ELQDS YLLP IS EDKVMEGKGQS PFDP 18 0 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

FNVIRHAVLLPADSPTAPHI YAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 

I I I I I I I 1 II I I I I II II I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

VFVGFSGGVWRVPRANCSVYESCVDCVL7VRDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qv 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 



Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

I I I I I I I I I I I I I I I I 1 I I I I I I I M I I I I I I I I I I I I I I I! I I I I 1 I I I I I I I I I I I I I 

Db 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I ! I I i I I I I I 1 I I I I I M I I I I I I I I 

Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 

RESULT 9 
ABO33660 

ID ABO33660 standard; protein; 761 AA. 
XX 

AC ABO33660; 
XX 

DT 17-SEP-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PR01317. 
XX 

KW Human; secreted and transmembrane protein; PRO; angiogenesis ; 

KW endothelial cell proliferation; wound healing; immune response; 

KW T-lymphocytes proliferation; neonatal heart hypertrophy; tumour; 

KW cardiac insufficiency disorder; calcium flux; inflammation; 

KW vascular endothelial growth factor-stimulated proliferation; 

KW mammalian kidney mesangial cell proliferation; Berger disease; 

KW nephropathy; Schanlein-Henoch purpura; celiac disease; Crohn's disease; 

KW dermatitis herpetiformis; diabetes; haemoglobin switch; insulinaemia; 

KW pancreatic beta-cell precursor cell differentiation; thalassemias; 

KW obesity; auditory hair cell regeneration; hearing loss; bone disorder; 

KW cartilage disorder; sports injury; arthritis. 

XX 

OS Homo sapiens. 
XX 

PN US2003073130-A1. 
XX 



PD 


17- 


-APR- 


2003. 




XX 
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2001; 2001US- 
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QY 
Db 

Qy 

Db 



1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 11 I I I I I I I I I I I I I I I I I 1 I M I I 
1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 



Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AH KHT AVL VD GML Y S GTMNN F LG S E P I LMRT LGS Q P VLKT DN FL RWLHH DAS FVAAI P S T 240 

I I I I I I I I 1 I I I I I I I I I I 1 I ! I I I 1 I I I I I I I I I I I I I I I I I I I I i I I I I I II I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDAS FVAAI PST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I II I ! I I I I I I I I I I I I I I I I ! I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

Qy 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQ Q S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 661 VKVP LT RVS GGAALAAQ Q S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 721 T LRP GEKAP L S REQHLQ S PKECRT SAS DVDADNNCLGTEVA 7 61 



RESULT 10 
AB044513 

ID AB044513 standard; protein; 761 AA. 
XX 

AC AB044513; 
XX 

DT 01-OCT-2003 (first entry) 
XX 

DE Human secreted/ transmembrane protein PR01317. 
XX 

KW Human; secreted protein; transmembrane protein; PRO; vulnerary; cardiant; 
KW antidiabetic; anorectic; antiarthritic; angiogenesis ; cancer; 



KW adrenal cortical capillary; endothelial cell growth; wound healing; 

KW stimulated T-lymphocyte proliferation; immune response suppression; 

KW neonatal heart hypertrophy; cardiac insufficiency disorder; 

KW vascular endothelial growth factor; inflammation; mononuclear cell; 

KW eosinophil; diabetes; obesity; or hyper-insulinaemia; hypo-insulinaemia; 

KW chondrocyte redif f erentiation; bone disorder; cartilage disorder; 

KW sports injury; arthritis. 

XX 

OS Homo sapiens. 
XX 

PN US2003044841-A1. 
XX 

PD 06-MAR-2003. 
XX 

PF 06-DEC-2001; 2001US-00006856 . 
XX 

PR 01-SEP-1998; 98US-0098716P . 

PR 01-SEP-1998; 98US-0098723P . 

PR 01-SEP-1998; 98US-009874 9P . 

PR 01-SEP-1998; 98US-0098750P . 

PR 02-SEP-1998; 98US-0098803P . 

PR 02-SEP-1998; 98US-0098821P . 

PR 02-SEP-1998; 98US-0098 843P . 

PR 09-SEP-1998; 98US-0099536P . 

PR 09-SEP-1998; 98US- 0099596P . 

PR 09-SEP-1998; 98US-0099598P . 

PR 09-SEP-1998; 98US-0099602P . 

PR 09-SEP-1998; 98US-0099642P . 

PR 10-SEP-1998; 98US-00997 4 IP . 

PR 10-SEP-1998; 98US-0099754P . 

PR 10-SEP-1998; 98US-0099763P . 

PR 10-SEP-1998; 98US-0099792P . 

PR 10-SEP-1998; 98US-0099808P . 

PR 10-SEP-1998; 98US-00998 12P . 

PR 10-SEP-1998; 98US-0099815P . 

PR 10-SEP-1998; 98US-0099816P . 

PR 15-SEP-1998; 98US-0100385P . 

PR 15-SEP-1998; 98US-0100388P . 

PR 15-SEP-1998; 98US-0100390P . 

PR 16-SEP-1998; 98US-0100584P . 

PR 16-SEP-1998; 98US-0100627P . 

PR 16-SEP-1998; 98US-0100661P . 

PR 16-SEP-1998; 98US-0100662P . 

PR 16-SEP-1998; 98US-0100664P . 

PR 17-SEP-1998; 98US-0100683P ♦ 

PR 17-SEP-1998; 98US-0100684P . 

PR 17-SEP-1998; 98US-0100710P . 

PR 17-SEP-1998; 98US-01007 IIP . 

PR 17-SEP-1998; 98US-0100919P . 

PR 17-SEP-1998; 98US-0100930P . 

PR 18-SEP-1998; 98US-0100848P . 

PR 18-SEP-1998; 98US-0100849P . 

PR 18-SEP-1998; 98US-0101014P . 

PR 18-SEP-1998; 98US-0101068P . 

PR 18-SEP-1998; 98US-0101071P . 

PR 22-SEP-1998; 98US-0101279P . 

PR 23-SEP-1998; 98US-0101471P . 



PR 23-SEP- 

PR 23-SEP- 

PR 23-SEP- 

PR 23-SEP- 

PR 23-SEP- 

PR 23-SEP- 

PR 24-SEP- 

PR 24-SEP- 

PR 24-SEP- 

PR 24-SEP- 

PR 24-SEP- 

PR 29-SEP- 

PR 29-SEP- 

PR 29-SEP- 

PR 29-SEP- 

PR 29-SEP- 

PR 30-SEP- 

PR 30-SEP- 

PR 30-SEP- 

PR 30-SEP- 

PR 01-OCT- 

PR 01-OCT- 

PR 02-OCT- 

PR 06-OCT- 

PR 06-OCT- 

PR 07-OCT- 

PR 07-OCT- 

PR 07-OCT- 

PR 07-OCT- 

PR 07-OCT- 

PR 07-OCT- 

PR 08-OCT- 

PR 08-OCT- 

PR 08-OCT- 

PR 08-OCT- 

PR 14-OCT- 

PR 20-OCT- 

PR 20-OCT- 

PR 2 0-OCT- 

PR 21-OCT- 

PR 22-OCT- 

PR 22-OCT- 

PR 26-OCT- 

PR 2 6-OCT- 

PR 27-OCT- 

PR 27-OCT- 

PR 27-OCT- 

PR 27-OCT- 

PR 28-OCT- 

PR 2 8-OCT- 

PR 28-OCT- 

PR 28-OCT- 

PR 28-OCT- 

PR 28-OCT- 

PR 29-OCT- 

PR 29-OCT- 

PR 29-OCT- 



1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


98US 


1998, 


? 98US 


1998, 


; 98US 


1998, 


? 98US 


1998 


i 98US 


1998 


? 98US 


1998 


98US 


1998 


98US 


1998 


; 98US 


1998 


98US 


1998 


? 98US 


1998 


98US 


1998 


98US 


1998 


98US 


1998 


98US 


1998 


? 98US 


1998 


; 98US 


1998 


98US 



0101472P. 
0101474P. 
0101475P. 
0101476P. 
0101477P. 
0101479P. 
0101738P. 
0101741P. 
0101743P. 
0101915P. 
0101916P. 
0102207P. 
0102240P. 
0102307P. 
0102330P. 
0102331P. 
0102484P. 
0102487P. 
0102570P. 
0102571P. 
0102684P. 
0102687P. 
0102965P. 
0103258P. 
0103449P. 
0103314P. 
0103315P. 
0103328P. 
0103395P. 
0103396P. 
0103401P. 
0103633P. 
0103678P. 
0103679P. 
0103711P. 
0104257P. 
0104987P. 
0105000P. 
0105002P. 
0105104P. 
0105169P. 
0105266P. 
0105693P. 
0105694P. 
0105807P. 
0105881P. 
0105882P. 
0106062P. 
0106023P. 
0106029P. 
0106030P. 
0106032P. 
0106033P. 
0106178P. 
0106248P. 
0106384P. 
0108500P. 



PR 


30- 


-OCT- 


1998; 


98US- 


0106464P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106856P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106902P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106905P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106919P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106932P. 


PR 


03- 


-NOV- 


1998; 


98US- 


0106934P. 


PR 


10- 


-NOV- 


1998; 


98US- 


0107783P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108775P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108779P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108787P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108788P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108801P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108802P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108806P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108807P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108867P. 


PR 


17- 


-NOV- 


1998; 


98US- 


0108925P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108848P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108849P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108850P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108851P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108852P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108858P. 


PR 


18- 


-NOV- 


1998; 


98US- 


0108904P. 


PR 


22- 


-DEC- 


1998; 


98US- 


0113296P. 


PR 


30- 


-DEC- 


1998, 


98US- 


0114223P. 


PR 


05- 


-JAN- 


1999, 


99WO- 


US000106. 


PR 


16- 


-APR- 


1999, 


99US- 


0129674P. 


PR 


23- 


-JUN- 


1999, 


99US- 


0141037P. 


PR 


20- 


-JUL- 


1999, 


99US- 


0144758P. 


PR 


26- 


-JUL- 


1999, 


99US- 


•0145698P. 


PR 


01- 


-SEP- 


1999, 


99WO- 


•US020111. 


PR 


15- 


-SEP- 


1999, 


99WO- 


-US021194. 


PR 


29- 


-OCT- 


1999, 


99US- 


■0162506P. 


PR 


30- 


-NOV- 


1999, 


9 9WO- 


-US028313. 


PR 


02 


-DEC- 


1999, 


99WO- 


■US028551. 


PR 


16 


-DEC- 


1999, 


f 99WO- 


-US030095. 


PR 


05 


-JAN- 


2000, 


j 2000WO- 


-US000219. 


PR 


06 


-JAN- 


-2000, 


; 2000WO- 


-US000376. 


PR 


11 


-FEB- 


•2000, 


; 2000WO- 


-US003565. 


PR 


18 


-FEB- 


■2000, 


; 2000WO- 


-US004342. 


PR 


24 


-FEB- 


■2000, 


; 2000WO- 


-US005004. 


PR 


02 


-MAR- 


•2000, 


; 2000WO- 


-US005841. 


PR 


15 


-MAR- 


-2000 


? 2000WO- 


-US006884. 


PR 


17 


-MAY- 


•2000 


; 2000WO- 


-US013705. 


PR 


22 


-MAY- 


-2000 


; 2000WO- 


-US014042. 


PR 


30 


-MAY- 


-2000 


; 2000WO- 


-US014941. 


PR 


02 


-JUN- 


-2000 


; 2000WO- 


-US015264. 


PR 


23 


-AUG- 


-2000 


? 200 0WO- 


-US023522 . 


PR 


24 


-AUG- 


-2000 


} 2000WO- 


-US023328 . 


PR 


08 


-NOV- 


-2000 


; 2000WO- 


-US030952. 


PR 


10 


-NOV- 


-2000 


; 2000WO- 


-US030873. 


PR 


01 


-DEC- 


-2000 


; 2000WO- 


-US032678. 


PR 


28 


-FEB- 


-2001 


; 2001WO- 


-US006520. 


PR 


01 


-MAR- 


-2001 


; 2001WO- 


-US006666. 


PR 


01 


-JUN- 


-2001 


; 2001WO- 


-US017800. 



PR 20-JUN-2001; 2001WO-US019692 . 

PR 29-JUN-2001; 2001WO-US021066 . 

PR 09-JUL-2001; 2001WO-US021735 . 

PR 04-SEP-2001; 2001US-00946374 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 

PI Gao W, Goddard A, Godowski PJ, Grimaldi JC, Gurney AL, Hillan KJ; 

PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 

PI Williams PM, Wood WI; 
XX 

DR WPI; 2003-492259/46. 

DR N-PSDB; ACH04538. 
XX 

PT Novel secreted and transmembrane polypeptides and polynucleotides 

PT encoding them useful for treating various cardiac insufficiency 

PT disorders, bone and/or cartilage disorders such as sports injuries and 

PT arthritis. 



Query Match 100.0%; Score 4031; DB 7; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I 1 I I I I I I I i I I I I I I 1 I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I II !! I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I! I 

Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I i I I I I I I I I I I 

Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 



Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I ! I I I I I 

Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 301 FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I || | | | | | | | I I M I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 



Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

| | | | | | | || | | | II I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

Qy 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 




i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i > i i i i i i i i i i i i i i i i t i i i i i t i i i i i 
1 I I 1 II I 1 i 1 1 1 1 1 1 1 II I 1 II 1 II 1 1 1 1 1 II 1 1 II 1 II 1 II II 1 ! 1 II 1 II II I 1 I 1 II 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






i i i i i i i i i i i i i i i t t i i i i i t i i t i i i i i i i i i i i i i i i i t i i t i i i i i i i i i i i i i i 
1 I I I I I II 1 II 1 1 1 1 1 1 1 1 1 1 1 i 1 11 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 L VAS P L RAL RARGKVQ G C E 


720 




1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





RESULT 11 
AB033537 

ID AB033537 standard; protein; 761 AA. 
XX 

AC AB033537; 
XX 

DT 17-SEP-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PR01317. 
XX 

KW Human; secreted and transmembrane protein; PRO; gene therapy; vaccine; 

KW tissue typing; chromosome identification; vaccine. 

XX 

OS Homo sapiens . 
XX 

PN US2003073129-A1. 
XX 



PD 


17- 


-APR- 


2003. 






XX 












PF 


04- 


-SEP- 


2001; 2001US- 


00946374. 


XX 












PR 


01- 


-SEP- 


1998, 


98US- 


0098716P. 


PR 


01- 


-SEP- 


1998, 


98US- 


0098723P. 


PR 


01- 


-SEP- 


1998, 


98US- 


0098749P. 


PR 


01- 


-SEP- 


1998, 


98US- 


0098750P. 


PR 


02 


-SEP- 


1998, 


98US- 


0098803P. 


PR 


02 


-SEP- 


1998, 


98US- 


0098821P. 


PR 


02 


-SEP- 


1998, 


98US- 


0098843P. 


PR 


09 


-SEP- 


1998, 


98US- 


0099536P. 


PR 


09 


-SEP- 


1998, 


98US- 


0099596P. 


PR 


09 


-SEP- 


1998 


i 98US- 


0099598P. 


PR 


09 


-SEP- 


1998 


; 98US- 


0099602P. 


PR 


09 


-SEP- 


1998 


; 98US- 


0099642P. 


PR 


10 


-SEP- 


1998 


? 98US- 


0099741P. 


PR 


10 


-SEP- 


1998 


? 98US- 


0099754P. 


PR 


10 


-SEP- 


•1998 


; 98US- 


0099763P. 


PR 


10 


-SEP- 


1998 


98US- 


0099792P. 



PR 


10- 


SEP- 


1998; 


PR 


10- 


SEP- 


1998; 


PR 


10- 


SEP- 


1998; 


PR 


10- 


SEP- 


1998; 


PR 


15- 


SEP- 


1998; 


PR 


15- 


SEP- 


1998; 


PR 


15- 


SEP- 


1998; 


PR 


16- 


SEP- 


1998; 


PR 


16- 


SEP- 


1998; 


PR 


16- 


SEP- 


1998; 


PR 


16- 


SEP- 


1998; 


PR 


16- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


17- 


SEP- 


1998; 


PR 


18- 


SEP- 


1998; 


PR 


18- 


SEP- 


1998; 


PR 


18- 


SEP- 


1998; 


PR 


18- 


SEP- 


1998; 


PR 


18- 


SEP- 


1998; 


PR 


22- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998; 


PR 


23- 


SEP- 


1998 ; 


PR 


24- 


SEP- 


1998; 


PR 


24- 


SEP- 


1998; 


PR 


24- 


•SEP- 


•1998; 


PR 


24- 


-SEP- 


•1998; 


PR 


24- 


-SEP- 


•1998; 


PR 


29- 


-SEP- 


•1998; 


PR 


29- 


-SEP- 


■1998; 


PR 


29- 


-SEP- 


-1998; 


PR 


29- 


-SEP- 


-1998; 


PR 


29- 


-SEP- 


-1998; 


PR 


30- 


-SEP- 


■1998; 


PR 


30- 


-SEP- 


-1998; 


PR 


30- 


-SEP- 


-1998; 


PR 


30- 


-SEP- 


-1998; 


PR 


01- 


-OCT- 


-1998; 


PR 


01- 


-OCT- 


-1998; 


PR 


02- 


-OCT- 


-1998; 


PR 


06- 


-OCT- 


-1998; 


PR 


06- 


-OCT- 


-1998; 


PR 


07- 


-OCT- 


-1998; 


PR 


07- 


-OCT- 


-1998 ; 


PR 


07- 


-OCT- 


-1998; 


PR 


07- 


-OCT- 


-1998; 


PR 


07- 


-OCT- 


-1998; 


PR 


07- 


-OCT- 


-1998; 


PR 


08- 


-OCT- 


-1998; 



98US-0099808P. 
98US-0099812P. 
98US-0099815P. 
98US-0099816P. 
98US-0100385P. 
98US-0100388P. 
98US-0100390P. 
98US-0100584P. 
98US-0100627P. 
98US-0100661P. 
98US-0100662P. 
98US-0100664P. 
98US-0100683P. 
98US-0100684P. 
98US-0100710P. 
98US-0100711P. 
98US-0100919P. 
98US-0100930P- 
98US-0100848P. 
98US-0100849P. 
98US-0101014P. 
98US-0101068P. 
98US-0101071P. 
98US-0101279P. 
98US-0101471P. 
98US-0101472P. 
98US-0101474P. 
98US-0101475P. 
98US-0101476P. 
98US-0101477P. 
98US-0101479P. 
98US-0101738P. 
98US-0101741P. 
98US-0101743P. 
98US-0101915P. 
98US-0101916P. 
98US-0102207P. 
98US-0102240P. 
98US-0102307P. 
98US-0102330P. 
98US-0102331P. 
98US-0102484P, 
98US-0102487P. 
98US-0102570P. 
98US-0102571P. 
98US-0102684P. 
98US-0102687P. 
98US-0102965P. 
98US-0103258P. 
98US-0103449P. 
98US-0103314P. 
98US-0103315P. 
98US-0103328P. 
98US-0103395P. 
98US-0103396P. 
98US-0103401P. 
98US-0103633P. 



PR 


08- 


-OCT- 


1998; 


98US- 


0103678P. 


PR 


08- 


-OCT- 


1998; 


98US- 


0103679P. 


PR 


08- 


-OCT- 


1998; 


98US- 


0103711P. 


PR 


14- 


-OCT- 


1998; 


98US- 


0104257P. 


PR 


20- 


-OCT- 


1998; 


98US- 


0104987P. 


PR 


20- 


-OCT- 


1998; 


98US- 


0105000P. 


PR 


20- 


-OCT- 


1998; 


98US- 


0105002P. 


PR 


21- 


-OCT- 


1998; 


98US- 


0105104P. 


PR 


22- 


-OCT- 


1998; 


98US- 


0105169P. 


PR 


22- 


-OCT- 


1998; 


98US- 


0105266P. 


PR 


26- 


-OCT- 


1998; 


98US- 


0105693P. 


PR 


26- 


-OCT- 


1998; 


98US- 


0105694P. 


PR 


27- 


-OCT- 


1998; 


98US- 


0105807P. 


PR 


27- 


-OCT- 


1998; 


98US- 


0105881P. 


PR 


27- 


-OCT- 


1998; 


98US- 


0105882P. 


PR 


27- 


-OCT- 


1998 ; 


98US- 


0106062P. 


PR 


28- 


-OCT- 


1998; 


98US- 


0106023P. 


PR 


28- 


-OCT~ 


1998; 


98US- 


0106029P. 


PR 


28- 


-OCT- 


1998/ 


98US- 


0106030P. 


PR 


28- 


-OCT- 


1998; 


98US- 


0106032P. 


PR 


28- 


-OCT- 


1998, 


98US- 


0106033P. 


PR 


28- 


-OCT- 


1998, 


98US- 


0106178P. 


PR 


29- 


-OCT- 


1998, 


98US- 


0106248P. 


PR 


29- 


-OCT- 


1998, 


98US- 


0106384P. 


PR 


29- 


-OCT- 


1998, 


98US- 


0108500P. 


PR 


30- 


-OCT- 


1998, 


98US- 


0106464P. 


PR 


03- 


-NOV- 


1998, 


98US- 


0106856P. 


PR 


03- 


-NOV- 


1998, 


98US- 


0106902P. 


PR 


03 


-NOV- 


1998, 


98US- 


0106905P. 


PR 


03 


-NOV- 


1998, 


98US- 


0106919P. 


PR 


03 


-NOV- 


1998, 


98US- 


0106932P. 


PR 


03 


-NOV- 


1998, 


98US- 


0106934P. 


PR 


10 


-NOV- 


1998, 


98US- 


0107783P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108775P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108779P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108787P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108788P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108801P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108802P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108806P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108807P. 


PR 


17 


-NOV- 


1998, 


98US- 


0108867P. 


PR 


17 


-NOV- 


1998, 


\ 98US- 


0108925P. 


PR 


18 


-NOV- 


1998, 


; 98US- 


0108848P. 


PR 


18 


-NOV- 


1998 


; 98US- 


0108849P. 


PR 


18 


-NOV- 


•1998 


? 98US- 


0108850P. 


PR 


18 


-NOV- 


1998 


? 98US- 


0108851P. 


PR 


18 


-NOV- 


1998 


? 98US- 


0108852P. 


PR 


18 


-NOV- 


-1998 


? 98US- 


0108858P. 


PR 


18 


-NOV- 


-1998 


98US- 


0108904P. 


PR 


22 


-DEC- 


-1998 


98US- 


00218517. 


PR 


22 


-DEC- 


-1998 


98US- 


0113296P. 


PR 


30 


-DEC- 


-1998 


; 98US- 


0114223P. 


PR 


05 


-JAN- 


-1999 


; 99WO- 


US000106. 


PR 


12 


-APR- 


-1999 


99US- 


■00284291. 


PR 


16 


-APR- 


-1999 


99US- 


■0129674P. 


PR 


23 


-JUN- 


-1999 


99US- 


•0141037P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 



20-JUL- 
2 6- JUL- 

01- SEP- 

15- SEP- 
18-OCT- 
30-NOV- 

02- DEC- 

16- DEC- 

05- JAN- 

06- JAN- 
11-FEB- 
18-FEB- 
24-FEB- 
02 -MAR- 
IS -MAR- 

17- MAY- 
2 2 -MAY- 
30-MAY- 
02-JUN- 

23- AUG- 

24- AUG- 

08- NOV- 
10-NOV- 
01-DEC- 

28- FEB- 
01-MAR- 
01-JUN- 
01-JUN- 
14-JUN- 
20-JUN- 

29- JUN' 

09- JUL 



-1999 
-1999 
-1999 
-1999 
-1999 
-1999 
-1999 
-1999 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 
-2001 



99US- 
99US- 
99WO- 
99WO- 
99US- 
99WO- 
99WO- 
99WO- 
2000WO- 
2000WO- 
2000WO- 
2000WO- 
2000WO- 
200 0WO- 
2000WO- 
2000WO- 
2000WO- 
2000WO- 
2000WO- 
200 OWO- 
2000WO- 
200 0WO- 
200 0WO- 
2000WO- 
2001WO- 
2001WO- 
2001US- 
2001WO- 
2001US- 
2001WO- 
2001WO- 
2 001WO- 



-0144758P. 
-0145698P. 
■US020111. 
-US021194. 
-00403297. 
-US028313. 
-US028551. 
-US030095. 
-US000219. 
-US000376. 
-US003565. 
-US004342 . 
-US005004 . 
■US005841. 
-US006884 . 
-US013705. 
-US014042. 
-US014941. 
-US015264. 
-US023522 . 
-US023328 . 
-US030952. 
-US030873. 
-US032678. 
-US006520. 
-US006666. 
-00872035. 
-US017800. 
-00882636. 
-US019692. 
-US021066. 
-US021735. 



(GETH ) GENENTECH INC. 

Baker KP, Botstein D, Desnoyers L, Eaton DL f Ferrara N, 
Gao W, Goddard A, Godowski PJ, Grimaldi JC, Gurney AL, 
Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, 
Williams PM, Wood WI ; 



Fong S; 
Hillan KJ; 
Watanabe CK; 



WPI; 2003-585292/55. 
N-PSDB; ACD68082. 

Novel isolated PRO polypeptides e.g. PR01491 and PR01571, useful in the 
preparation of a medicament for treating a condition responsive to PRO 
polypeptide, and as therapeutic agents e.g. vaccines. 

Claim 12; Fig 158; 561pp; English. 

The invention describes an isolated PRO (secreted and transmembrane) 
polypeptide (I), having at least 80% sequence identity to a sequence 



Query Match 100.0%; Score 4031; DB 7; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7 61; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


1 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






■ i < i i i i i i i i i i i i i t i i i i t i i i i t i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i 
I | i | t I I I I i 1 1 1 1 1 II 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 II M M M II I II II 1 




Db 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 


Qy 


181 


AHKHT AVLVD GML Y S GTMNN FLG S EP I LMRT LGS Q PVLKT DN FLRWLHHDAS FVAAI P S T 


240 




, i i i ■ i i i i i i ■ i i i i i i i i i » » i i i i i i i i i i i i i i i i i i i 1 i i i i i i 

I I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


7VHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDAS FVAAI P ST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 




1 I 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVI RHAVLLPADS PTAPHI YAVFT S QWQVGGT RS SAVCAFS LLDI ERVFKGKYKELNKE 


360 




I | | M | I I I I II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FNVI RHAVLLPADS PTAPHI YAVFT S QWQVGGT RS SAVCAFS LLDI ERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 




I | I I I I I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 II 1 I 1 II 1 II 1 1 I 1 1 1 1 1 




Db 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 




I I I I I I I I I I I 1 1 1 1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 




1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 




, ittiillllllllllllllllllltlllllillllllltlllfllltlllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qv 


661 


VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ G C E 


720 




I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


T LRP GEKAP L S REQHLQ S P KECRT S AS D VDADNN CLGT EVA 761 
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PA (GETH ) GENENTECH INC. 
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PI Baker KP, Botstein D, Desnoyers L r Eaton DL, Ferrara N, Fong S; 
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PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 

PI Williams PM, Wood WI; 
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DR WPI; 2003-555602/52. 

DR N-PSDB; ADC18145. 
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PT Novel isolated PRO polypeptides e.g. PR01491 and PR01571, useful in the 

PT preparation of a medicament for treating a condition responsive to PRO 

PT polypeptide, and as therapeutic agents e.g. vaccines. 
XX 

PS Claim 12; SEQ ID NO 277; 555pp; English. 
XX 

CC The invention relates to human PRO polypeptides and the polynucleotides 

CC encoding them. The sequences are useful in the preparation of a 

CC medicament for treating a condition responsive to a PRO polypeptide. The 

CC polypeptides are useful in a number of functional biological assays, as 

CC molecular weight markers for protein electrophoresis and as therapeutic 
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Best Local Similarity 100.0%; Pred. No. 0; 
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Novel isolated PRO polypeptides e.g., PRO1130, PR01275, PR01418, PR01555, 
PR01787 affect glucose or free fatty acid (FFA) uptake by skeletal muscle 
cells and are useful for treating diabetes or hyper- or hypo-insulinemia . 

Claim 12; SEQ ID NO 277; 553pp; English. 
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PT New secreted and transmembrane PRO polypeptides useful for treating 

PT cancers, kidney disorders, Crohn's disease, diabetes mellitus, hyper- or 

PT hypo-insulinemia, sports injuries and arthritis. 

XX 

PS Claim 12; SEQ ID NO 277; 557pp; English. 
XX 

CC The invention relates to an isolated PRO polypeptide (secreted or 

CC transmembrane protein) having at least 80% amino acid sequence identity 

CC to an amino acid sequence chosen from 123 fully defined sequences as 

CC given in the specification (including their extracellular domains either 

CC or without their associated signal peptides. Also include are the 

CC nucleotide (NA) sequences encoding PRO, a vector comprising the PRO NA, a 

CC host cell comprising the vector, producing PRO, a chimaeric molecule 

CC comprising PRO fused to a heterologous amino acid sequence, and an anti- 

CC PRO antibody. Pro is useful as molecular weight markers for protein 

CC electrophoresis and also for chromosome identification. PRO is also 

CC useful for tissue typing. PRO and PRO NA are useful as hybridisation 

CC probes for a cDNA library to isolate the full-length PRO cDNA. PRO NA is 

CC useful for generating transgenic animals or knock-out animals which are 

CC useful in development and screening useful reagents. PRO NA is also 

CC useful in gene therapy. PR01244, PR01286 and PRO1303 polypeptides are 

CC useful for treating cancerous tumours. PRO1250, PR01418 and PRO1410 

CC polypeptides are useful for suppressing immune response. PR0124 6 
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CC PRO1410 and PR01575 are useful in treating thalassaemias . The present 

CC sequence represents a PRO protein of the invention. 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 



Search completed: May 5, 2004, 17:28:40 
Job time : 62 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



May 5, 2004, 17:27:32 ; Search time 23 Seconds 

(without alignments) 
1708.146 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-015-391A-277 
4031 

1 MALPALGLDPWSLLGLFLFQ CRT S AS DVDADNN C L GT EVA 761 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1: /cgn2__6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/ 5B__COMB . pep : * 

3: /cgn2_6/ptodata/2/iaa/6A_COMB.pep:* 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/2/iaa/PCTUS__COMB . pep : * 
6 : /cgn2_6/ptodata/2/iaa/backf iles 1 . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1150.5 


28. 


5 


607 


4 
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Sequence 


2, Appli 
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Sequence 


54, Appl 
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Sequence 


54, Appl 
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Sequence 


54, Appl 
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54, Appl 
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54, Appl 
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Sequence 


7, Appli 
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US-09-077-940A-4 


Sequence 
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US-08-835-268-58 


Sequence 


58, 


Appl 
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US-09-060-692-58 


Sequence 


58, 
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US-08-833-391-58 


Sequence 


58, 


Appl 
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58, 
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Sequence 


60, 
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ALIGNMENTS 



RESULT 1 

US-08-556-422A-4 

Sequence 4, Application US/08556422A 
Patent No. 6576754 
GENERAL INFORMATION: 
APPLICANT: HALL, Kathryn T. 
APPLICANT: FREEMAN, Gordon J. 
APPLICANT: SCHULTZE, Joachim L. 
APPLICANT: BOUSSIOTIS, Vassiliki 
APPLICANT: NADLER, Lee M. 

TITLE OF INVENTION: NUCLEIC ACIDS ENCODING CD100 MOLECULES 
FILE REFERENCE: DFN-005CPA2 

CURRENT APPLICATION NUMBER: US/ 08/556, 422A 
CURRENT FILING DATE: 1995-11-09 
NUMBER OF SEQ ID NOS: 7 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 4 
LENGTH: 607 



; TYPE: PRT 

ORGANISM: Mus mus cuius 
US-08-556-422A-4 

Query Match 28.5%; Score 1150.5; DB 4; Length 607; 

Best Local Similarity 41.2%; Pred. No. 3.9e-106; 

Matches 254; Conservative 103; Mismatches 215; Indels 45; Gaps 18; 

Qy 49 ERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVP — RLKNMIPWPAS 106 

II: I : : : : I I I I I I I I I I I I I I I : I I : : I : : : | | 

Db 1 EERLIRKFEAENISNYTALLLSQDGKTLYVGAREALFALNSNLSFLPGGEYQELL-WSAD 59 

QY 107 DRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISE 166 

: I : I : I I I : I I : I : : I : I : II I I I I I I I I I : I : I 
Db 60 ADRKQQCSFKGKDPKRDCQNYIKILLPLNSSHLLTCGTAAFSPLCAYIHIAS FTLAQDEA 119 

Qy 167 DKVM EGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFL 224 

I : : II I I I I I I I I : : I I I I I : I I : : : I I : : I : | : I : I | | : : | 
Db 120 GNVI LEDGKGHCP FD PNFKSTALWDGELYTGTVS S FQGND PAI S RS QS S RPT- KTES S L 178 

Qy 225 RWLHHDASFVAAIPSTQ WYFFFEETASEFDFFERLHTSRVARVCKNDVG 274 

II 1:111: I : : I I I I I I I I : I I I I I I I I I I I I I 

Db 179 NWL-QDPAFVASATSPESLGSPIGDDDKIYFFFSETGQEFEFFENTIVSRVARVCKGDEG 237 

Qy 275 GEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVT RHAVLL PAD S P TAP H I YAVFT S QWQ V 330 

I I : : M : : I I : I I I I I I I I : : I I I I I :: | II | I I I I I I 

Db 238 GERVLQQRWTSFLKAQLLCSRPDDGFPFNVLQDVFTLNPNPQDWRKTLSI-GVFTSQWHR 296 

Qy 331 GGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSCSVGPS 384 

II I I : I I : : I : : : I I I I : : I : I I : I I I I I I : I : 

Db 297 GTTEGSAI CVFTMNDVQKAFDGLYKKVNRETQQWYTETHQVPTPRPGACITNSARERKIN 356 

Qy 385 SDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAVETAQGLDGHS-HLVMYLGT 438 

I : I I : I I I I I I I I I ||:: I I : I I II II : I : : I I I 

Db 357 SSLQLPDRVLNFLKDHFLMDGQVRSRLLLLQPRARYQRVAVHRVPGL — HSTYDVLFLGT 414 

Qy 439 TTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCS 498 

I I I I I I I I : : I I : I : I I : I I : I I I I : : I I : I I I I I I 

Db 415 GDGRLHKAVTL-SSRVHIIEELQIFPQGQPVQNLLLDSHGGLLYASSHSGWQVPVANCS 473 

Qy 499 VYE S C VD C VL ARD P H C AWD PESRTCCLL S APN LN S — WKQ DME RGN P E WAC AS G PMS R S L 556 

: I : I I I : I I I I I : I I I : I I : I I I I I : I : : | : 

Db 474 LYPTCGDCLLARDPYCAWTGSACRLASLYQPDLASRPWTQDIEGASVKELCKNSSYKARF 533 

Qy 557 RPQSRPQIIKEVLAVPNSILELPCPHLSA1ASYYWSHGPAAVPEASS — TVYNGSLLLI- 613 

: I hi II:: I I I I I I I : II II : : I : III): 
Db 534 LVPGKP — CKQVQIQPNTWTLACPLLSNLATRLWVHNGAPVNASASCRVLPTGDLLLVG 591 

Qy 614 VQ D GVGGL YQ CWAT EN G 630 

I I: I::|||: I I 
Db 592 SQQGL-GVFQCWSIEEG 607 



RESULT 2 

US-08-556-422A-2 

; Sequence 2, Application US/08556422A 
; Patent No. 6576754 



; GENERAL INFORMATION: 

; APPLICANT: HALL, Kathryn T. 

; APPLICANT : FREEMAN, Gordon J. 

; APPLICANT: SCHULTZE, Joachim L. 

; APPLICANT: BOUSSIOTIS, Vassiliki 

; APPLICANT: NADLER, Lee M. 

; TITLE OF INVENTION: NUCLEIC ACIDS ENCODING CD100 MOLECULES 
; FILE REFERENCE: DFN-005CPA2 

; CURRENT APPLICATION NUMBER: US/08/556, 422A 
; CURRENT FILING DATE: 1995-11-09 
; NUMBER OF SEQ ID NOS : 7 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 
; LENGTH: 8 62 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-08-556-422A-2 

Query Match 25.9%; Score 1044; DB 4; Length 862; 

Best Local Similarity 35.2%; Pred. No. 3.4e-95; 

Matches 271; Conservative 115; Mismatches 263; Indels 120; Gaps 27; 

QY 38 PMPRVRYYAGDERRALSF — FHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVP 95 

I • I I : : II: I I : : : : I I I I I = I I I : I I I I I : I : : : : 

Db 26 PIPRITW EHREVHLVQFHEPDIYNYSALLLSEDKDTLYIGAREAVFAVN7VLN — IS 79 

QY 96 RLKNMI PWPAS DRKKSECAFKKKSNETQCFNFI RVLVS YNVTHLYTCGT FAFS PACTFI E 155 

: : : I I : I I : : I I I I I : I : I I : I II I : I I I I I I I I I II : 
Db 80 EKQHEVYWKVS EDKKAKCAEKGKS KQTECLN YI RVLQPLSAT S L YVCGTNAFQPACDHLN 139 

Qy 156 LQDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQ 215 

I I -II I I I : I I I I II : I : I : I I I I I | | ! I I I I I I I I : I 

Db 140 LTSFKFLGKNED GKGRCPFDPAHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHS 195 

QY 216 PVLKTDNFLRWLHHDASFVAA IPST QWYFFFEETASEFDFFERLHTSRV 2 65 

I 1:1: : || :: Ml I I : Mill I : |::| |: |: 

Db 196 P-LRTEYAI PWL-NEPS FVFADVI RKS PDS PDGEDDRVYFFFTEVSVEYEFVFRVLI PRI 253 

Qy 266 ARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVIRHAVLLPADSPTAPHIYAVF 324 

I I I I I I I I : I I I I I I : I I I I : I : I :: | I I I I : I : I : | | | : | 

Db 254 ARVCKGDQGGLRTLQKKWT S FLKARLI C S RP DS GLVFNVLRDVFVLRS PGLKVPVFYALF 313 

Qy 325 TSQWQVGGTRSSAVCAFSLLDIERVFK-GKYKE LNKETSRWTTYRGPETNPRPGSCS 38 0 

I I : I I I I I :: I III II! : : : : : I IN | | | | : | 

Db 314 TP QLNNVGLSAVCAYNLSTAEEVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACI 371 

Qy 381 VGPS S D KALT FMKDH FLMDEQ W GTPLLVKS GVEYTRLAVETAQGL 426 

' II I hill Mh I I Ml I IM: M I I 

Db 372 DSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIWDRTQAL 431 

Qy 427 DGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQG — AVFVG 484 

II : IM: I I : I I I I : I : : I : : I I I I | | | | | : | | : :| |: | 

Db 432 DGSVYDVMFVSTDRGALHKA-ISLEHAVHIIEETQLFQDFEPVQTLLLSSKKGNRFVYAG 490 

Qy 485 FSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPE 544 

: I I : I I I :: I I I I II II I : I I I I : | | | | | 

Db 491 SNSGWQAPLAFCGKHGTCEDCVLARDPYCAWSPPTATCVALHQTESPSRGLIQEMSGDA 550 



Qy 


545 


Db 


551 


Qy 


605 


Db 


600 


Qy 


657 


Db 


650 


Qy 


710 


Db 


687 



I III III :| I :, | 
PDKSKGSYRQHFFKHGGTAELKCSQKSNLARVFWKF-QNGVLKAESP 599 

-NGSLLLIVQDGVGGLYQCWATE NGFSYPVISYWVDSQDQTLALDPELAGI 656 

I: : :| 1:111 : I I : I : : | I: : 



I : I | : | : | : : | : : | | | | | | 

PKPWAPTLSWQTEGSRIATK VLVASTQGSSPPTPA 686 

710 LRARGKVQGCETLRPGEKAP L S REQHLQ S P KE CRT S AS D 748 

I I I I I : I I : : I I I : | | 



RESULT 3 

US-08-121-713D-54 

Sequence 54, Application US/08121713D 
Patent No. 5639856 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES : 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/ 121 , 713D 
FILING DATE: 13-SEP-1993 
CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
TELEX: 

INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 771 amino acids 



TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-121-713D-54 



Query Match 20.5%; Score 825.5; DB 1; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24 

Qy 35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQD 91 

I : : i I : : I : : : I : I I I : : I i I I I : : | : | : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 

Qy 92 PGVPRLKNM— IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 14 9 

: I : Mil : : | | : | : I I I I : I I : II I I II I M I I I 
Db 85 IKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 

Qy 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

I I : i I : : : • I : I : I I : I I : : I : I I I I I I I : I : I : 

Db 140 I CTYI EI GHHPEDNI FKLENSHFENGRGKS P YDPKLLTASLLI DGELYS GTAADFMGRDF 199 

Qy 207 I LMRT LG S Q P VL KT DN F- LRWLHH DAS FVAA 1 PSTQWYFFFEETASEFDFFE 258 

: I'll : : | : I I I : I I : : | | I I I I I | | : : 

Db 200 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG Q L P FNVI RHAVL L PAD S 314 

: : I : I I I I I I : I I I I I I I I I : I : I : I I | : : : | : 

Db 259 KATHARI GQI CKNDFGGHRSLVNKWTTFLKARLI CSVPGPNGI DTHFDELQDVFLMNFKD 318 

QY 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I : I I I I : : : I I I I : I : I : I I I | | : : | | : | | 
Db 319 PKNPWYGVFTTSSNI — FKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPYQGRVPYP 376 

Qy 375 RPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKSGVEY— TRLAV 42 0 

111=1 I I :|| : | | | I:::|: I I |:: I 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

: II : M::ll h: I I I I : I I : : I : I : : : | : 

Db 437 DRVDAEDG-QYDVMFIGTDVGTVLKWSIPKETWYDLEEVLLEEMTVFREPTAISAMELS 4 95 

Qy 476 PTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL— N 532 

I : : : I : I I : : I I : I : : | : | | | | | | : | | | | : | | 
Db 4 96 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 SWKQDMERGNPEWACAS — GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: Ml: 1:1 I: |: I : I II I I I I I 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

I I : I I I : I I I I I I : I I 
Db 608 WQFQRRNEERKEEI RVDDHI I RTDQGLLLRS LQQKDSGNYLCHAVEHGFI 657 

Qy 64 3 Q DQT L AL D P E LAG I P REH VKVP LT RVS GG AALAAQQSYW 681 

I I I I I I I I :: | : | :: | , 

Db 658 QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVW 702 



RESULT 4 

US-08-835-268-54 

Sequence 54, Application US/08835268 
Patent No. 5807826 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/835, 268 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/121, 713 
FILING DATE: 13-SEP-1993 
ATTORNEY/ AGENT INFORMATION: 
NAME : Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 4 15 ) 34 3-434 1 
TELEFAX: (415) 343-4342 
TELEX: 

INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 771 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-835-268-54 



Query Match 20.5%; Score 825.5; DB 1; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24 

QY 35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQD 91 

I : : I I : : I : : : | : I I I : : I I I I I : : I : I : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 



QY 92 PGVPRLKNM— IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 149 



= 1: I M I :: II : I : I I I I : I I : I I I I I I I I I | | | 
Db 85 IKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 

Qy 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

I I : I I : : = : I : I : I i : I I : : I : | | ( I I I I : I : | : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 207 I LMRT LGS QP VLKTDNF- LRWLHHDAS FVAA IPSTQWYFFFEETASEFDFFE 258 

• MM ::h Ml :| MM | I I I II I I : : 

Db 2 00 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QL P FN VI RHAVL L PAD S 314 

= M : : M M I I I : I II I M II I : I : I : I I I : : : I : 

Db 259 KATHARIGQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNGIDTHFDELQDVFLMNFKD 318 

Qy 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

1 I M MM : : I I I I M : I : M I I I : : I I : | | 
Db 319 PKNPWYGVFTTS SNI — FKGSAVCMYSMS DVRRVFLGPYAHRDGPNYQWVP YQGRVP YP 376 

Qy 375 RPGSC S VGP S S DKALT FMKDH FLMDEQW GTPLLVKSGVEY--TRLAV 42 0 

I I M I I | : | | : | | | | : : : | : | | | : : | 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVIKTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

• M : I M : I I I : : I I | I : I I : : I : I : : : I : 

Db 437 DRVDAEDG-QYD VMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFREPTAI SAMEL S 495 

Qy 476 PTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL — N 532 

1 : : : I : I I : : I I M : : I : I II I I M I I I I : I I 
Db 496 TKQQQLYIGSTAGVAQLPLHRCDI YGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 SWKQDMERGNPEWACAS — GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: MM Ml M I : I : I I I II III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER II YGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

I I : III M I I I I | : || 
Db 608 WQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFI 657 

Qy 643 QDQTLALDPELAGI PREHVKVPLTRVSGG AALAAQQSYW 681 

I II I I I | | :: | : | :: | | 

Db 658 — QTL- LKVTLEVI DTEHLEELLHKDDDGDGSKTKEMSNSMT PSQKVW 702 



RESULT 5 

US-09-060-692-54 

; Sequence 54, Application US/09060692 
; Patent No. 5935865 
; GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 

APPLICANT: Kolodkin, Alex L. 

APPLICANT: Matthes, David 

APPLICANT: Bentley, David R. 
; APPLICANT: O 1 Connor, Timothy 

; TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 



; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 Bush Street, Suite 3200 

CITY: San Francisco 
; STATE : CA 

; COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/060,692 

FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 121, 7 13 

FILING DATE: 13-SEP-1993 
ATTORNEY/AGENT INFORMATION: 
; NAME: Osman, Richard A. 

; REGISTRATION NUMBER: 36,627 

REFERENCE/DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415)343-4341 

TELEFAX: (415) 343-4342 

TELEX : 

; INFORMATION FOR SEQ ID NO: 54: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 771 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-060-692-54 

Query Match 20.5%; Score 825.5; DB 2; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24; 

QY 35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQD 91 

I : HI:: I : : : I : I I I : : I I I I I : : I : I : : 

Db 25 GKNNVPRLKLS YKEMLESNNVITFNGLANS S S YHTFLLDEERS RLYVGAKDHI FS FDLVN 84 

QY 92 PGVPRLKNM--IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 149 

•I: I II I :: II : I : I M I : I I : I I I I I I I I I I I I 
Db 85 IKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 

QY 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

I I : I I : : : : I : I : I I : I I : : I : I I I I I I | : I : I : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 2 07 ILMRTLGSQPVLKTDNF-LRWLHHDASFVAA IPSTQWYFFFEETASEFDFFE 258 

: I I I I : : I : III : I I : : I I I I I I I I I : : 

Db 200 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLPFNVIRHAVLLPADS 314 

= : I : : : I I I I I I : I I I I I I I I I : I : I : I | | : : : | : 

Db 259 KATHARIGQI CKNDFGGHRSLVNKWTTFLKARLI CSVPGPNGI DTHFDELQDVFLMNFKD 318 



Qy 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I : I I I I : : : I I I I : I : h I I I I I : : I I : I | 
Db 319 PKNPWYGVFTTSSNI — FKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPYQGRVPYP 376 

QY 375 RPGSC SVGP S S DKALT FMKDH FLMDEQW GT P LLVKS GVE Y- - T RLAV 420 

I I I - I I I :M : I I I |:::|: | | |:: | 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVIKTDWYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

: II : I I : : I I I : : I I I I : I I : : I : | : : : I : 

Db 437 DRVDAEDG-QYDVMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFREPTAI SAMELS 495 

Qy 476 PTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL--N 532 

I : : : I : I I : : i I : I : : I : I I I I I I : i I I I : | | 
Db 4 96 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 SWKQDMERGNPEWACAS — GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: : I I : I : I I : I : I : I I I II III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS — HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

I I : III : I 11111:1! 
Db 608 WQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFI 657 

Qy 643 QDQTLALDPELAGIPREHVKVPLTRVS GG AALAAQQSYW 681 

MM I I II:: I : I :: II 

Db 658 — QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVW 702 



RESULT 6 

US-08-833-391-54 

Sequence 54, Application US/08833391 
Patent No. 6013781 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/833, 391 
FILING DATE: 



CLASSIFICATION: 530 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/121,713 
FILING DATE: 13-SEP-1993 
; ATTORNEY/ AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415)343-4341 
TELEFAX: (415) 343-4342 
TELEX : 

; INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 771 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-391-54 



Query Match 20.5%; Score 825.5; DB 3; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24 

QY 35 GQGPMPRVRY-YAG— DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQD 91 

I : : I I : : I : : : | : I I I : : I I I I I : : I : | : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 

QY 92 PGVPRLKNM IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 149 

: I : 1 I I I :: I I : I : I I I I : I I : I I I II I I I I I I I 
Db 85 1 KDFQKI VWPVS YTRRDECKWAGKDI LKECANFI KVLKAYNQTHLYACGTGAFHP 139 

QY 150 ACTFIEL— QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

N : I I : : : : I : I : I I : I I : : I : I I I | | | | : I : I : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 207 ILMRTLGSQPVLKTDNF-LRWLHHDASFVAA 1 PSTQWYFFFEETASEFDFFE 258 

: I I I I : : I : I I I : I | : : | | I I I I I I | : : 

Db 200 AI FRTLGHHHPIRTEQHDSRWL-NDPKFI SAHLI SESDNPEDDKVYFFFRENAIDGEHSG 258 

QY 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLPFNVIRHAVLLPADS 314 

• : I : : : I I I I I I : I I I I I I I I I : I : I : I I | : : : | : 

Db 259 KATHARIGQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNGIDTHFDELQDVFLMNFKD 318 

QY 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I : I I I I : : : I I I I : I : I : I I I | | : : | | : | | 
Db 319 P KNPWYGVFTT S SNI — FKGS AVCMYSMS DVRRVFLGP YAHRDGPN YQWVP YQGRVP YP 376 

Qy 375 RPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKSGVEY — TRLAV 420 

I I I : I I | : | | : | | | | : :: | : | | | :: | 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VS GDS SAHLVEEI QLFPDPEPVRNLQLA 475 

: II : ll::|| I:: I I I I : I I : : | : | : : : | : 

Db 437 DRVDAEDG-QYDVMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFREPTAI SAMELS 495 



Qy 



476 PTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL--N 532 



I : : : I : M : : I I : I : : I : I I I I I I : I I I I : I | 
Db 4 96 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

QY 533 SWKQDMERGNPEWACAS — GPMS RS LRPQSRPQI I KEVLAVPNS I LELPCPHLSALAS YY 590 

: : I I : I : I I : | : | : I I I I I III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER II YGVENSSTFLECSPKSQRALVY 607 

QY 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

! I : III - I Mllhll 
Db 608 WQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFI 657 

Qy 643 QDQTLALDPELAGI PREHVKVPLTRVSGG AALAAQQSYW 681 

M I I I I I I : : I : I : : | | 

Db 658 — QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVW 702 

RESULT 7 

US-09-060-610-54 

Sequence 54, Application US/09060610 
Patent No. 6344544 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bent ley, David R. 
APPLICANT: O f Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES : 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/060,610 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/835,268 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 4 15 ) 343-434 1 
TELEFAX: (415) 343-4342 
TELEX : 

INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 771 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-060-610-54 

Query Match 20.5%; Score 825.5; DB 4; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24 

Qy 35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQD 91 

I : : I I : : I : : : I : I I I : : I I I I I : : I : | : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLW 84 

Qy 92 PGVPRLKNM — IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 14 9 

: I : I I I I :: I I : I : I I I I : I I : I I I I I I I I I | | | 

Db 85 IKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 

Qy 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 2 06 

I I : I I : : : : I : I : I I : I I : : I : I I I I I I I : I : I : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 207 ILMRTLGSQPVLKTDNF-LRWLHHDASFVAA IPSTQWYFFFEETASEFDFFE 258 

: I I I I :: I : III : I I :: I I I I I I I I I : : 

Db 200 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG Q L P FN VI RH AVL L PAD S 314 

: : I : : : | | I I I I : I II I I I I I I : I : I : I I I : : : I : 

Db 259 KATHARIGQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNGIDTHFDELQDVFLMNFKD 318 

Qy 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I ♦ I III: : : ( | | I : i : I : I I I I I : : I I : I I 
Db 319 P KNP WYGVFTT S SNI — FKGS AVCMYSMS DVRRVFLGP YAHRDGPN YQWVP YQGRVP YP 37 6 

Qy 375 RPGSC S VGP S S D KALT FMKDH FLMDEQ W GTPLLVKSGVEY — TRLAV 420 

I I I : I I I : I I : I I I | : :: | : | | | : : | 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDS SAHLVEEIQLFPDPEPVRNLQLA 475 

: II : I I : : I I I : : I I I I : I I : : I : I : : : I : 

Db 437 DRVDAEDG-QYD VMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFREPTAI SAMEL S 495 

Qy 476 PTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL — N 532 

I : : : I : I I : : | I : I : : I : I I I I I I : I I I I : I I 
Db 496 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 S WKQ DME RGN P EWAC AS — GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: : I I : I : I I : I : I : I I I II III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

I I : I I I : I I I I I I : I I 
Db 608 WQFQRRNEERKEEI RVDDHI I RTDQGLLLRS LQQKDS GNYLCHAVEHGFI 657 

Qy 643 QDQTLALDPELAGI PREHVKVPLTRVSGG AALAAQQSYW 681 

I I I I I I I I : : I : I : : | | 

Db 658 --QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKW 7 02 



RESULT 8 

PCT-US94-10151A-54 

; Sequence 54, Application PC/TUS94 10151A 
; GENERAL INFORMATION: 

APPLICANT: The Regents of the University of California 
TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 66 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: FLEHR HOHBACH TEST ALBRITTON & HERBERT 
STREET: 4 Embarcadero Center, Suite 3400 
; CITY: San Francisco 

; STATE: CA 

COUNTRY: USA 
ZIP: 94111-4187 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US94/ 10151A 

FILING DATE: 13-SEP-1994 

CLASSIFICATION: 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Osman, Richard A. 

REGISTRATION NUMBER: 36,627 

REFERENCE/DOCKET NUMBER: FP-5875 0-PC/RAO 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 781-1989 

TELEFAX: (415) 398-3249 

TELEX: 910 277299 FHT UR 
; INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 771 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US94-10151A-54 



Query Match 20.5%; Score 825.5; DB 5; Length 771; 

Best Local Similarity 30.5%; Pred. No. 2.3e-73; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24 

35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQD 91 
I : I : : : I : | | | : : | | | | | : : | : | : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 

QY 92 PGVPRLKNM — IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSP 14 9 

: I : I I I I : : I I : I : I I I I : I I : I i I I I I I I I I I I 
Db 85 1 KDFQKI VWPVS YTRRDECKWAGKDI LKECANFI KVLKAYNQTHLYACGTGAFHP 139 

Qy 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

11=11: : : : |:|:||:|| ::|:|| ||||| : | : | : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 



Qy 


207 


ILMRTLGSQPVLKTDNF-LRWLHHDASFVAA IPSTQWYFFFEETASEFDFFE 

: 1 1 1 1 : : I : | | | : | | : : | | Mill 1 1 : : 
AI FRTLGHHHPIRTEOHDSRWL-NDPKFT^AHT.T^F^nwPFnnTr\A^VTrTTDT7MnTnrTrua^ 


258 


Db 


200 


258 


Qy 


259 


RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLPFNVIRHAVLLPADS 

: : 1 : : : 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 : 1 : I : || | : :: | : 
KATHARIGQICKNDFGGHRSLWKWTTFT.KART.Tr^VPr^PMr^TnTHF-nPT nnwTMMwn 


314 


Db 


259 


318 


Qy 


315 


PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 
1 1 : 1 1 1 1 : : : 1 1 1 1 : 1 : I : I I I | [ : : | | : | | 
PKNPWYGVFTTSSNI — FKGSAVCMYSM^nVRPVFT ^PYnHPnrDMvnTAn/DvnrDunvn 


374 


Db 


319 


376 


Qy 


375 


RPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKSGVEY--TRLAV 

"hi 1 1 :H : I I 1 |:::|: | | |:; | 
RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVNYQFTQIW 


420 


Db 


377 


436 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

: I 1 : 1 1 ? * 1 1 I • • i i i i . i i . . i . i . . 


Db 


437 


i * i i • i i i i i • i • : : | : 

DRVDAEDG-QYDVMFIGTDVGTVLKWSIPKETWYDLEEVLLEEMTVFREPTAISAMELS 


495 


Qy 


476 


PTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL— N 
1 : II ::| | : | : : | : | | | | | | : | | | | . | | 

TKQQQLYIGSTAGVAQLPLHRCDI YGKACAECCLARDPYCAWD GSACSRYFPTAKRR 


532 


Db 


496 


552 


Qy 


533 


S WKQDMERGN PEWACAS GPMS RS LRPQS RPQI I KEVLAVPNS I LELPC PHLSALAS YY 
: :ll: hi |: |: | : 1 II 1 1 1 I | 
TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 


590 


Db 


553 


607 


Qy 


591 


WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 


642 


DJd 


608 


1 1 : 1 1 1 : 1 1 1 1 1 h 1 1 
WQFQRRNEERKEEI RVDDHI I RTDQGLLLRSLQQKDS GNYLCHAVEHGFI 


657 


Qy 


643 


QDQTLALDPELAGIPREHVKVPLTRVSGG AALAAQQSYW 681 




Db 


658 


INI 1 I |h: 1 : | :: | 1 
QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVW 7 02 





RESULT 9 

US-08-556-422A-3 

; Sequence 3, Application US/08556422A 

; Patent No. 6576754 

; GENERAL INFORMATION: 

; APPLICANT: HALL, Kathryn T. 

; APPLICANT: FREEMAN, Gordon J. 

; APPLICANT : SCHULTZE, Joachim L. 

; APPLICANT: BOUSSIOTIS, Vassiliki 

; APPLICANT: NADLER, Lee M. 

; TITLE OF INVENTION: NUCLEIC ACIDS ENCODING CD100 MOLECULES 
; FILE REFERENCE: DFN-005CPA2 

; CURRENT APPLICATION NUMBER: US/08/556, 422A 
; CURRENT FILING DATE: 1995-11-09 
; NUMBER OF SEQ ID NOS : 7 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 655 
; TYPE: PRT 

ORGANISM: Homo sapiens 



US-08-556-422A-3 

Query Match 20.2%; Score 815.5; DB 4; Length 655; 

Best Local Similarity 31.3%; Pred. No. 1.8e-72; 

Matches 203; Conservative 114; Mismatches 262; Indels 69; Gaps 21 

Qy 35 GQGPMPRVRY-YAG — DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQD 91 

I : : I I : : I : : : I : I I I : : I I I I I : : I : I : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 

Qy 92 P G VP RLKNM — IPW PAS D RK K S E C AF KK K SNETQCFNFI RVL VS YN VT H L YT C GT FAF S P 149 

: I : I I I I : : I I : I : I I I I : I I : I I I I I I I ! I I I I 
Db 85 1 KDFQKI VWPVS YTRRDECKWAGKDI LKECANFI KVLKAYNQTHLYACGTGAFHP 139 

Qy 150 ACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

I I : I I : : : : I : I : I I : I I : : I : I I I I ! I I : I : I : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 207 I LMRT L G S Q P VL KT DN F- L RW LH H DAS FVAA IPSTQWYFFFEETASEFDFFE 258 

: I I I I :: I : I I I : I I :: I I I I I I I I I : : 

Db 200 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG Q L P FN VI RHAVL L PAD S 314 

: : I : : : | | | | | | : | I I I I I I I I : I : I : I I I : : : | : 

Db 259 KATHARI GQI CKNDFGGHRSLVNKWTTFLKARLI CSVPGPNGI DTHFDELQDVFLMNFKD 318 

Qy 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I : I I I I : : : I I I I : I : I : I I I I I : : I hi I 
Db 319 PKNPWYGVFTTSSNI — FKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPYQGRVPYP 376 

Qy 375 RPGSC S VGP S S DKALT FMKDH FLMDEQ W GTPLLVKSGVEY — TRLAV 420 

I I I : I I I : I I : I I I I : : : I : I I I : : I 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

: II : I I : : I I I :: I I I I : I I : : I : I : : : I : 

Db 437 DRVDAEDG-QYDVMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFREPTAI SAMELS 495 

Qy 476 PTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL — N 532 

I ::: I : I i :: I I : I : : I : I I I I I I : I I I I : I I 
Db 496 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 S WKQDMERGN P EWACAS — GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: : I I : I : I I : I : I : I I I I I III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENG 630 

I I : I I I : I llllhl 

Db 608 WQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHG 655 

RESULT 10 
US-09-308-179B-1 

; Sequence 1, Application US/09308179B 

; Patent No. 6436669 

; GENERAL INFORMATION: 

; APPLICANT: INAGAKI, Shinobu 



; APPLICANT: FURUYAMA, Tatsuo 

; TITLE OF INVENTION: NOVEL SEMAPHORIN GENES (I) 
; FILE REFERENCE: 0020-4562P 

; CURRENT APPLICATION NUMBER: US/09/308 , 179B 
; CURRENT FILING DATE: 1999-05-14 
; PRIOR APPLICATION NUMBER: PCT/ JP97/04 111 
; PRIOR FILING DATE: 1997-11-12 

; PRIOR APPLICATION NUMBER: JAPAN 321068/1996 

; PRIOR FILING DATE: 1996-11-15 

; NUMBER OF SEQ ID NOS : 4 

; SOFTWARE: Patentln Ver. 2*1 

; SEQ ID NO 1 

LENGTH: 775 

TYPE: PRT 

ORGANISM: Mus sp. 
US-09-308-179B-1 



Query Match 17.6%; Score 709.5; DB 4; Length 775; 

Best Local Similarity 29.4%; Pred. No. le-61; 

Matches 214; Conservative 120; Mismatches 290; Indels 105; Gaps 31; 

QY 13 LLGLFLF-QLLQLLLPTTTAGGGGQGPMPRVRYYAGD — ERRALSFFHQK-GLQDFDTLL 68 

si I I: 11:1 I I 11:1 : I I I I I |:| 

Db 7 ILTLLLWGHLLELWTP GHSANPSYPRLRLSHKELLELNRTSIFQSPLGFLDLHTML 62 

QY 69 LSGDGNTLYVGAREAILALDIQDPGVPRLKN MI PWPASDRKKSECAFKKKSNETQCF 125 

I I : I I | : : : | : : : | : : I | | : : I | | | | : : | 

Db 63 LDEYQERLFVGGRDLVYSLNLE RVS DG YREI YWP STAVKVEECIMKGK- DANECA 116 

QY 126 NFIRVLVSYNVTHLYTCGTFAFSPACTFIEL— QDSYLLPISEDKVMEGKGQSPFDPAH 182 

I : M M I I I I I I I I I I I I I I : : I : : 1 : I : I I I I 

Db H7 NYIRVLHHYNRTHLLTCATGAFDPHCAFIRVGHHSEEPLFHLESHRSERGRGRCPFDPNS 176 

QY 183 KHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAA — IP ST 240 

ill I : : I : : : | : : | : : | : : | : : | s I I : I | 
Db 177 S FVSTLVGNELFAGLYSDYWGRDSAI FRSMGKLGHI RTEHDDERLLKEPKFVGS YMI PDN 236 

Qy 241 Q WYFFFEETASEFDFFERLHT--SRVARVCKNDVGGEKLLQKKWTTFLKAQLLC 293 

: : I I II I i I : II : | | | : | | | : | | : : : | II : M I I I : I : I 

Db 237 EDRDDNKMYFFFTEKALEAE — NNAHTIYTRVGRLCVNDMGGQRILVNKWSTFLKARLVC 294 

Qy 294 TQPG Q L P FN VI RHAVL L PAD S P TAP H I Y AVFT S Q WQ VG GT R S S AVC AF S L L D I E RV 349 

: I I I : : III I I I : : 1 : : | ||| : : | 

Db 295 SVPGMNGIDTYFDELEDVFLLPTRDPKNPVI FGLFNTTSNI — FRGHAVCVYHMS S I REA 352 

Qy 350 FKGKYKELNKETSRWTTYRGPETNPRPGSCS VGPSS DKALTFMKDHFLMD 399 

III I : I I I I I I I I : | : I I : I : I I I 

Db 353 FNGPYAHKEGPEYHWSLYEGKVPYPRPGSCASKVNGGKYGTTKDYPDDAIRFARMHPLMY 412 

Qy 400 EQWGT PLLVKSGVEYT — RLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSA 454 

: : hill: : I : I I I : : II : I : : : M II : I : : 

Db 413 QPI KPVHKKPI LVKTDGKYNLRQLAVDRVEAEDG-QYDVLFI GTDTGI VLKVITI YNQET 471 

Qy 455 H LVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYES-CVDCVL 508 

: : I I : I : I I I I : : : : : : : : : : | : | : | : I : I I I I I I 
Db 472 EWMEEVILEELQI FKDPAPI I SMEI SSKRQQLYI GSASAVAQVRFHHCDMYGSACADCCL 531 



Qy 509 ARDPHCAWDPESRTCCLLSAP NLNSWKQDMERGNPEWAC ASGPMSRSLRP 558 

I I I I : I I I I I I I : I I : | I I : I : 
Db 532 ARDPYCAWDGIS CSRYYPTGAHAKRRFRRQDVRHGNAAQQCFGQQFVGDALDRT 585 

Qy 559 QSRPQIIKEVLAV PNSILELPCPHLSALASYYW — SHGPAAVPEASST VYN 607 

: I I I I I I I I I I I I II : 

Db 58 6 EERLAYGIESNSTL-LECTPRSLQAKVIWFVQKGRDVRKEEVKTDDRWKMDL 637 

Qy 608 GSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGI PREHV 661 

Mil: III I M M : : : : : : I : I 

Db 638 GLLFLRVRKSDAGTYFCQTVEHNFVHTVRKITLE WEEHKVEGMFHKDHEEERHH 692 

Qy 662 KVPLTRVSG 670 

I : I : I I 
Db 693 KMPCPPLSG 701 



RESULT 11 
US-08-556-422A-7 

Sequence 7, Application US/08556422A 
Patent No. 6576754 
GENERAL INFORMATION: 
APPLICANT: HALL, Kathryn T. 
APPLICANT: FREEMAN, Gordon J. 
APPLICANT: SCHULTZE, Joachim L. 
APPLICANT: BOUSSIOTIS, Vassiliki 
APPLICANT: NADLER, Lee M. 

TITLE OF INVENTION: NUCLEIC ACIDS ENCODING CD100 MOLECULES 
FILE REFERENCE: DFN-005CPA2 

CURRENT APPLICATION NUMBER: US/ 08/556, 422A 
CURRENT FILING DATE: 1995-11-09 
NUMBER OF SEQ ID NOS : 7 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7 
LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-08-556-422A-7 

Query Match 17.3%; Score 699; DB 4; Length 425; 

Best Local Similarity 38.7%; Pred. No. 4.1e-61; 

Matches 167; Conservative 60; Mismatches 159; Indels 46; Gaps 13; 

Qy 191 GML YS GTMNN FLGS E P I LMRT LGS Q P VLKT DN FLRWLHHDAS FVAA IPST 240 

II I I I I MINIM: | | 1 : | : : II : : Ml I I : 

Db 1 GELYSGTSYNFLGSEPIISRNSSHSP-LRTEYAI PWL-NEPSFVFADVI RKSPDSPDGED 58 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-L 299 

Mill I : I :: I I : I : I I I I I I II : I I II M : I M I : I : I : : I I 
Db 59 DRVYFFFTEVSVEYEFVFRVLIPRIARVCKGDQGGLRTLQKKWTSFLKARLICSRPDSGL 118 

Qy 300 P FNVI RHAVLLPADS PTAPH I YAVFT S QWQVGGTRS SAVCAFS LLDI ERVFK- GKYKE — 356 

II I M : I : I I I: II I : I I I M : : I II I MM 

Db 119 VFNVLRDVFVLRSPGLKVPVFYALFTP — QLNNVGL S AVCAYN L S T AEE VFS H GK YMQ S T 176 

Qy 357 -LNKETSRWTTYRGPETNPRPGSCSVGPS SDKALTFMKDHFLMDEQW- 4 03 

• • ' s I Mi II I I : I : II I I : I I II I I M 



Db 



177 TVEQSHTKWVRYNGPVPKPRPGACIDSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTP 236 



QY 404 — GTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQ 461 

I I : I I I I : : I : I I I I : I I : : I I : I I I I : | : : | : : | I | 
Db 237 I DNRPRLI KKDWYTQI WDRTQALDGTVYDVMFVSTDRGALHKA- 1 SLEHAVHI I EETQ 295 

QY 462 LFPDPEPVRNLQLAPTQG — AVFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPE 519 

I I I I I I : I I : : | I : I : I I : I I I : : | I I I I I | | I : | | | I 
Db 296 LFQDFEPVQTLLLSSKKGNRFVYAGSNSGWQAPLAFCGKHGTCEDCVLARDPYCAWSPP 355 

Qy 520 SRTCCLLSAPNLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELP 579 

sill II I I : : M 

Db 356 TATCVALHQTESPSRGLIQEMSGDASVC PDKSKGSYRQHFFKHGGTAELK 405 

Qy 580 CPHLSALASYYW 591 

I I I I : I 
Db 406 CSQKSNLARVFW 417 



RESULT 12 
US-09-077-940A-4 

; Sequence 4, Application US/09077940A 

; Patent No. 6576441 

; GENERAL INFORMATION: 

; APPLICANT: KIMURA, Toru et al . 

; TITLE OF INVENTION: NOVEL SEMAPHORIN Z AND GENE ENCODING THE SAME 
; FILE REFERENCE: 0020-4426P 

; CURRENT APPLICATION NUMBER: US/09/077 , 94 OA 

; CURRENT FILING DATE: 1998-06-05 

; NUMBER OF SEQ ID NOS : 20 

; SOFTWARE: Patent In version 3.1 

; SEQ ID NO 4 

LENGTH: 8 88 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-077-940A-4 



Query Match 16.7%; Score 673; DB 4; Length 888; 

Best Local Similarity 26.9%; Pred. No. 5.8e-58; 

Matches 210; Conservative 117; Mismatches 225; Indels 228; Gaps 32; 

Qy 21 LLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDG 73 

M I I I I I I I :: || : | : : | | | 

Db 15 LLLLLL GGAHGLFP EDPPPLSVAPRDYLNHYPVFVGSGPGRLTPAEG 61 

Qy 74 NTL YVGAREAI LALDIQDPGVPRLK — NMI PWPAS DRKKSECAFKKK 118 

M::| I: : :::: I I: : I :: : I I I 
Db 62 ADDLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGK 121 

Qy H9 SNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPF 178 

I : I I I : : I I : : : I : | | : | | : | | : I | : : : : | : | : 
Db 122 -QEGECRNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDT — LQPVGDN — ISGMARCPY 176 

Qy 179 DPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDAS 232 

111:1: MM:: I: :|| : :: |:M :| hi : ||: 

Db 177 D P KH AN VAL F S D GML FT AT VT D FLAI D AVI YRSLGDRPTLRT VKHDS KWFKEP Y 230 



Qy 233 FVAAIPSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGE-KLLQKKWTTFLKAQL 291 

III" I I i I I I I I I : : I : : I I I I II I I I I I I I : : I : I : M : II I I : I 

Db 231 FVH AVEWG S HVYF FFRE I AME FN YL E KVWS RVARVC KNDVGG S P RVL EKQWT S FLKARL 290 

QY 292 LCTQPG QLP FN VI RHAVL L PAD S PT APH I YAVFT S Q WQVGGT RS S AVC AFS LL D I E RV 349 

1=11 I I I :: I I | : | M :: I I I I I I I : I 

Db 291 NCSVPGDSHFYFNVLQ-AVTGWSLGGRPVVLAVFST — PSNSIPGSAVCAFDLTQVAAV 347 

QY 350 FKGKYKELNKETSRWTTYRGPE TNPRPGSCSV GPSS DKALTFMKDHFLM 398 

I : I ::: I IN II Mill: II | I I : I | I I 

Db 348 FEGRFREQKS PES IWTPV — PEDQVPRPRPGCCAAPGMQYNAS SALPDDI LNFVKTHPLM 405 

Qy 399 DEQWG TPLLVKSGVEY — TRLAVETAQGLDGHSHLYMYLGTTTGSLHKAWSGDSS 453 

Ml I : : : : : : I I : I I : | | : I : : I I : I : : I : I : : | 

Db 406 DEAVPSLGHAPWILRTLMRHQLTRVAVDVGAGPWG-NQTWFLGSEAGTVLKFLVRPNAS 464 

Qy 454 AH LVEEIQLF-PD P E P VRN LQL AP T Q GAVFVG F S GG WRVP RA 495 

: I I : : I I : : : I : | | : | Mill 

Db 465 TSGTSGLSVFLEEFETYRPDRCGRPGGGETGQRLLSLELDAASGGLLAAFPRCWRVPVA 524 

Qy 496 NCSVYESCV-DCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDME 539 

I I I : : I : : : I I : I I I : : I II : : : I I : 

Db 525 RCQQYSGCMKNCIGSQDPYCGWAPDG-SCIFLSPGTRAAFEQDVSGASTSGLGDCTGLLR 583 

Qy 540 539 

Db 584 AS LS EDRAGLVS VNLLVT S SVAAFVVGAVVS GFS VGWFVGLRERRELARRKDKEAI LAHG 643 

Qy 54 0 RGN PE WACAS GP — MS 553 

M M III: II: 

D b 644 AGEAVLSVSRLGERRAQGPGGRGGGGGGGAGVP PEAL LAP LMQNGWAKATLLQGGPHDLD 703 

Qy 554 RSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLI 613 

M I : I I I I I I I III : I : : I : I I ! I : 
Db 704 SGLLP TPEQTPLPQKRLPTPHPHPHALGPRAWDHGHPLLPASASS SLLLL 753 



RESULT 13 
US-09-077-940A-2 

; Sequence 2, Application US/09077940A 

; Patent No. 6576441 

; GENERAL INFORMATION: 

; APPLICANT: KIMURA, Toru et al. 

; TITLE OF INVENTION: NOVEL SEMAPHORIN Z AND GENE ENCODING THE SAME 
; FILE REFERENCE: 0020-4426P 

; CURRENT APPLICATION NUMBER: US/09/077 , 940A 

; CURRENT FILING DATE: 1998-06-05 

; NUMBER OF SEQ ID NOS : 20 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 2 

; LENGTH: 887 

; TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-077-940A-2 



Query Match 16.7%; Score 672.5; DB 4; Length 887; 

Best Local Similarity 27.0%; Pred. No. 6.4e-58; 



Matches 229; Conservative 132; Mismatches 291; Indels 195; Gaps 35; 

Qy 18 LFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDG 73 

I I I I II I II II II : I : : I I I 

Db 14 LFLLLLLLRVT HGLFP DEPPPLSVAPRDYLSHYPVFVGSGPGRLTP 59 

Qy 74 NT L YVGARE AI LALD I Q D P GVP RLKNMI PWPAS DRKKS ECA 114 

I I : : I I : : : : : : I I : : I : : I 

Db 60 AEGAEDLNIQRVLRVNRTLFIGDRDNLYQVEL-EPSTSTELRYQRKLTWRSNPSDIDVCR 118 

Qy 115 FKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKG 174 

II I : I I I : : I I : : : I : I I : I I : I I : II : : I 
Db 119 MKGK-QEGECRNFVKVLLLRDESTLFVCGSNAFNPICANYSMDTLQLLGDN ISGMA 173 

Qy 175 QSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDAS — 232 

: I : I I I : I : I I I I : : I : : I I : : : I : I I : I I : I : I I : 

Db 174 RC P YD P KH AN VAL F S D GML FT AT VT D FL AI D AVI YRSLGDRPTLRT VKHDSKWF 227 

Qy 233 FVAAI PSTQ WYFFFEETAS E FDFFERLHT S RVARVCKNDVGGE- KLLQKKWTT FL 287 

III: I I I I I I I I I : : I : : I I I I I I I II I I I I : : I : I : I I : I I 

Db 228 KEPYFWAVEWGSHVYFFFREIAMEFNYLEKVWSRVARVCKNDVGGSPRVLEKQWTSFL 287 

Qy 288 KAQ LLCTQPG — QLP FN VI RHAVL L PAD S P TAP H I YAVF T S Q WQ VG G T R S S AVC AF S L L D 345 

11:11:11 III:: II I I III:: I I I I I I : 

Db 288 KARLNC S VP GD S H F Y FNVLQ- AVT GWS L GGRPVI LAVFS T — P SN S I P GS AVCAFDMNQ 344 

Qy 34 6 IERVFKGKYKELNKETSRWTTYRGPE TNPRPGSCSV GPSSDKALTFMKD 394 

: I I : I : : : I III II Mill: I : I I : I 

Db 345 VAAVFEGRFREQKSPESIWTPV — PEDQVPRPRPGCCAAPGMQYNASNALPDEILNFVKT 4 02 

Qy 395 HFLMDEQWG TPLLVKSGVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWS 449 

I I I I I I : | : | : : : : I I : I I : | | : : I : I I : I : : I : I 

Db 403 H P LMD EAVP S L GH S P W I VRT L I RHQ L T RVAVDVGAG PWGNQT IV- FL GS EVGTVL K FLVK 4 61 

Qy 450 GDSSAH LVEEIQLF-PD PEPVRNLQLAPTQGAVFVGFSGGVW 4 90 

: : I : | | : : | | : : : | : | | : | | 

Db 462 PNASVSGTTGPSIFLEEFETYRPDRCGRSSSAGEWGQRLLSLELDAASGGLLAAFPRCW 521 

Qy 491 RVPRANCSVYESCV-DCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACAS 549 

I II I I : I | : : | : : : | | : | | | : : | | : : : | | : : 

Db 522 RVPVARCQLYSGCMKNCIGSQDPYCGWAPDG-SCIFLRPGTSATFEQDVSGASTSGLGDC 580 

Qy 550 GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYW 591 

: I : I : : I I : I : : : I I 

Db 581 TGLLRAS L S DDRAGLVSVNLLVT S S VAAFVVGAVVS GFS VGWFVGLRERRELARRKDKEA 640 

Qy 592 — SH GPAAVPEASSTVYNGSLLLIVQDGVGGL 621 

: I I I I I I I : : I : I 
Db 641 ILAHGGSEAVLSVSRLGERRGTGTGGRGGAGGGPGGPPEA LLAPLMQNG 689 

Qy 622 YQCW ATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSGGAALAAQQ 67 8 

I I : I : : | | I I : | : : : : | | I I 

Db 690 WTKAALLHGGPHDL DSGLLPTPEQTPLPQK — RLPTTHPH AHALGP 733 

Qy 67 9 SYWPHFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSREQHLQS 738 

II III: II II: : I I I I I I : 
Db 734 RAWDH SHALLSASASTSLLLLAHTRAPEQPPV-PTESGPESR LCA 777 



Qy 739 PKECRTS 745 

1:111 
Db 778 PRSCRAS 784 



RESULT 14 
US-09-653-274-8 

Sequence 8, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT: Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-23 

CURRENT APPLICATION NUMBER: US/ 09/ 653 , 274 
CURRENT FILING DATE : 2 000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 
PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 8 
LENGTH: 1070 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-653-274-8 



Query Match 16.5%; Score 666; DB 4; Length 1070; 

Best Local Similarity 29.1%; Pred. No. 4e-57; 

Matches 206; Conservative 124; Mismatches 242; Indels 136; Gaps 35; 

Qy 63 DFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIP WPAS DRKKS ECAFKK 117 

II : I I III: I : : : : : : : I : : : I | | : : : | | | 

Db 42 DFQLMLKIRD — TLYIAGRDQVYTVNLNE — MPKTE-VIPNKKLTWRSRQQDRENCAMKG 96 

Qy 118 KSNETQCFNFI RVLVS YNVTHLYTCGTFAFS PACTFI ELQD- S YLLPI SEDKVMEGKGQS 176 

I :: : I I I I : I I I :: I I I I I : I I : I | : : : | : 

Db 97 K-HKDECHNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEY DGEEISGLARC 150 

Qy 177 P FDPAHKHTAVLVDGML YS GTMNN FLGS EP I LMRTLGSQPVLKTDNF- LRWLHHDAS FVA 235 

III : I : I I I I I I : : I I | : : : | : : | I : | : : | : : | : 

Db 151 P FDARQTNVAL FADGKL YSATVAD FLAS DAVI YRSMGDGSALRT I K YDS KWI - KEPHFLH 209 



Qy 236 AIPSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGG-EKLLQKKWTTFLKAQLLCT 294 

II I I II I I I I : : I I I I I : I I I I : I I :: : I : I I I : I I I I : I I : 

Db 210 AIEYGNYVYFFFREIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCS 269 

Qy 295 QPGQ LP FN VI RHAV- L L PAD S P TAP H I YAVFT S QWQ VGGT RS S AVC AF S L L D I E RVFK 351 

II I = l = : :: : | : | | | : | : I I I I I I I : I I I : I I I 

Db 270 VPGDPFFYFDVLQSITDIIQING — IPTWGVFTT — QLNSIPGSAVCAFSMDDIEKVFK 325 



Qy 


352 


^* + X A VX_» XJX * X \. J — 1 JL X \ V V X X XI \\J X J— ) X XV X i\L VJ K-J \^ I^J V VJ i_ O \_) r\ /~\ 1 t X X 11 X\ i_J _Lj_ x J_jl 1 




Db 


326 


1 : : 1 1 III II Mill: : | : | : | : | | | | 
GRFKEOKTPDSVWTAV — PEDKVPKPRPGCCAKHGLAFAYKTSTriF'PriFTT ^PTIf toptm 


"3 Q Q 


Qv 

« J: 


399 


DEQV VGTPLLVKSGVEY — TRLAVETAQGLDGHSHLYMYLGTTTGSLHKAWSGD — 


A R 1 


Db 


384 


1 1 1 1 : 1 1 I : : I : : | : : | : : : | : | : | : 

DS AVP P I ADE PWFTKTRVRYRLTAT S VDH S Af^~ P YOM YTVT FV^q f a rM\n v\n avtqdtt 


44^ 


Qv 


452 


— SSAHLVEEIQLF PDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY 

: : 1 : 1 1 1 : : : : | : | | | | : : | | | : | : | : | | 
SLNDSVLLEEI EAYNHAKCSAENEEDKKVI SLOLDKnHHAT, WAFq qrT T RT PT q dp fuv 


r n r\ 
jUU 


Db 


443 


r n o 


Qy 


501 


ESC-VDCVLARDPHCAWDPESRTCCLLSAPNL NSWKQDMERGNPE 

II 1 : : 1 1 1 : 1 1 1 : 1 I : ::| 1 1 I 1 

GSCKKSCT ASRDPYPf^W — T.^Of^^ Pr^PA/TPf^MT t t TT?m?i?airUMUc!7\T?rvT7nn'PT?T?rM 
^i\i\u^in.o i\Lsr x \^\3vv xioyoo^oAv x Xr o1 v 1XjXjXj X IjJJr r Ac xllN xi o/\XjLj x HiyiJl Eib olN 


044 


Db 


503 


c n o 

boa 


Qy 


545 


WACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASST 
1 1 : 1 : : 1 1 : 1 : : : : I I : | | : 

— T AHLnnPHFT TiPT 1 ^TTPFJV WT TPtZCW QnMTr\/Q Q Q ot/ t T""P'M'A cj TnDTmm/ 


OU4 


Db 


559 


606 


Qv 


605 


VYN GSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIP 
1 : 1 -III :: | | 
VI DTWRPKLTS SRKFWQDD PNTSDFT DP-LSGIP 


657 


Db 


607 


640 


Qy 


658 


RE HVKVP LT RVS GGAALAAQQ S YW P H F- VT VT VL FALVL S GAL 1 1 L VA 704 

1 II : : 1 1 : 1 : 1 1 : 1 1 1 1 1 
KGVRWEVQSG E S NQMVHMN VL I T C VFAAFVL GAF I AGVA 679 




Db 


641 





RESULT 15 
US-09-653-274-4 

Sequence 4, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT: Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-23 

CURRENT APPLICATION NUMBER: US/ 09/ 653 , 274 
CURRENT FILING DATE: 2000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 
PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 4 
LENGTH: 1086 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-653-274-4 



Query Match 



16.5%; Score 666; DB 4; Length 1086; 



Best Local Similarity 29.1%; Pred. No. 4.1e-57; 

Matches 206; Conservative 124; Mismatches 242; Indels 136; Gaps 35; 

Qy 63 DFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIP WPAS DRKKS ECAFKK 117 

II : I I III: I : : : : : : : I : : : I I I : : : III 

Db 58 DFQLMLKIRD — TLYIAGRDQVYTVNLNE — MPKTE-VIPNKKLTWRSRQQDRENCAMKG 112 

Qy 118 KSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQD-SYLLPISEDKVMEGKGQS 17 6 

I : : : I I I I : I I I : : I I I I I : I I : I I : : : | : 

Db 113 K- H KDE CHN FI KVFVP RN D EMVFVCGTNAFN PMCRY YRL S T LE Y DGEEISGLARC 166 

Qy 177 PFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVA 235 

III : I : I I I I I I : : I I I : : : I : : I I : I : : I : : I : 

Db 167 P FDARQTNVALFADGKL YSAT VAD FLAS DAVI YRSMGDGSALRT I KYDS KWI -KEPHFLH 225 

Qy 236 AIPSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGG-EKLLQKKWTTFLKAQLLCT 294 

II I I I I I I I I : : I I I I I : I I I I : I I : : : I : I I I : I I I I : I I : 

Db 226 AI EYGNYVYFFFREI AVEHNNLGKAVYS RVARI CKNDMGGSQRVLEKHWTS FLKARLNCS 285 

Qy 295 QPGQ — L P FNVI RH AV- L L PAD S P TAP H I YAVFT S QWQVG GT RS S AVCAF S L LD I E RVFK 351 

II I : I : : :: : I : III: I : I I I I I I I : I I I : I II 

Db 286 VP GD P FFYFDVLQ S I TDI I QING- - 1 PT WGVFTT — QLNS I P GSAVCAFSMDDI EKVFK 341 

Qy 352 GKYKELNKETSRWTTYRGPE TNPRPGSCSVGPSS DKALT FMKDHFLM 398 

I : : I I III II Mill: : I : I : I : I I I I 

Db 342 GRFKEQKTPDSVWTAV — PEDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLM 399 

Qy 399 DEQV VGTPLLVKSGVEY--TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGD— 451 

I I I 1:11 I : : I : : | : : I : : : | : I : I : 

Db 400 DSAVPPIADEPWFTKTRVRYRLTAISVDHSAG-PYQNYTVI FVGSEAGMVLKVLAKTSPF 458 

Qy 452 — SSAHLVEEIQLF PDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY 500 

: : I : I I I : : : : I : I I I I : : I I I : I : I : I I 

Db 459 SLNDSVLLEEIEAYNHAJKCSAENEEDKKVI SLQLDKDHJiALYVAFSSCI IRI PLSRCERY 518 

Qy 501 ESC-VDCVLARDPHCAWDPESRTCCLLSAPNL NSWKQDMERGNPE 544 

II I : : I I I : I I I : I I : : : I I I II 

Db 519 GSCKKSCIASRDPYCGW — LSQGSCGRVTPGMLLLTEDFFAFHNHSAEGYEQDTEFGN — 574 

Qy 545 WACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASST 604 

I I : I : : I I : I : : : : I I : I I : 

Db 575 -TAHLGDCHEILPTSTTPDY — KI FGGPTSDMEVS SS SVTTMAS IPEITPK 622 

Qy 605 VYN GSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIP 657 

I : I : I I I : : I I I I I : I I I 
Db 623 VI DTWRPKLT S S RKFWQDD PNTSDFT DP-LSGIP 656 

Qy 658 RE HVKVP LT RVS G GAALAAQQ S YW P H F- VT VT VL FALVL S GAL 1 1 LVA 7 04 

I II : : I I : I : I I : I I I I I 

Db 657 KGVRWEVQSG ESNQMVHMNVLITCVFAAFVLGAFIAGVA 695 



Search completed: May 5, 2004, 17:31:25 
Job time : 2 6 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



May 5, 2004, 17:25:56 ; Search time 21 Seconds 

(without alignments) 
3485.798 Million cell updates/sec 

US-10-015-391A-277 
4031 

1 MALPALGLDPWSLLGLFLFQ C RT S AS DVDADNNC LGT EVA 761 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIRJ78:* 
pirl : * 
pir2:* 
pir3:* 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


3280.5 


81.4 


760 


2 


148745 


semaphorin B - mou 


2 


1198.5 


29.7 


782 


2 


148746 


semaphorin C - mou 


3 


1079.5 


26. 8 


834 


2 


S66498 


M-sema F protein p 


4 


838.5 


20.8 


753 


2 


G02173 


semaphorin III fam 


5 


838 


20.8 


772 


2 


A49069 


collapsin - chicke 


6 


836 


20.7 


772 


2 


148747 


semaphorin D - mou 


7 


825.5 


20.5 


771 


2 


D49423 


semaphorin III pre 


8 


820 


20.3 


749 


2 


G01856 


semaphorin V - hum 


9 


802.5 


19.9 


751 


2 


148748 


semaphorin E - mou 


10 


784.5 


19.5 


748 


2 


148744 


semaphorin A - mou 


11 


766 


19.0 


666 


2 


158169 


semaphorin III - m 


12 


677.5 


16.8 


1074 


2 


JC5928 


semaphorin F precu 


13 


657.5 


16.3 


730 


2 


JH0798 


fasciclin IV precu 



14 


621.5 


15, 


.4 


712 


2 


T27165 


hypothetical prote 


15 


609.5 


15. 


.1 


711 


2 


A49423 


semaphorin I precu 


16 


605.5 


15, 


,0 


656 


2 


B49423 


semaphorin I - fru 


17 


600.5 


14, 


,9 


724 


2 


C49423 


semaphorin II prec 


18 


442.5 


11. 


,0 


653 


2 


T03102 


semaphorin homolog 


19 


350 


8, 


,7 


676 


2 


T33853 


hypothetical prote 


20 


281.5 


7. 


,0 


1905 


2 


151553 


Plexin - African c 


21 


277 


6. 


,9 


1872 


2 


JC4976 


plexin 3 precursor 


22 


262 


6. 


5 


1894 


2 


JC4980 


plexin 1 precursor 


23 


238 


5. 


.9 


1884 


2 


JC4975 


plexin 2 precursor 


24 


228 


5. 


7 


2051 


2 


T13164 


plexin B - fruit f 


25 


225.5 


5. 


6 


1945 


2 


T13937 


plexin A - fruit f 


26 


157 


3. 


9 


1806 


2 


T23298 


hypothetical prote 


27 


151.5 


3. 


8 


1568 


2 


T09074 


semaphorin recepto 


28 


146.5 


3. 


6 


403 


2 


E42521 


A39R protein - vac 


29 


145 


3. 


6 


441 


2 


S29921 


hypothetical prote 


30 


139.5 


3. 


5 


866 


2 


T06454 


probable lipoxygen 


31 


132.5 


3. 


3 


317 


2 


T46426 


hypothetical prote 


32 


132.5 


3. 


3 


1375 


1 


JC5148 


hepatocyte growth 


33 


128.5 


3. 


2 


446 


2 


AI1253 


glutathione reduct 


34 


127.5 


3. 


2 


1369 


1 


JC4860 


protein-tyrosine k 


35 


127 


3. 


2 


1425 


2 


T30811 


hepatocyte growth 


36 


121 


3. 


0 


1374 


2 


T30809 


plasminogen relate 


37 


120 


3. 


0 


868 


2 


T06827 


lipoxygenase (EC 1 


38 


119 


3. 


0 


1400 


1 


138185 


protein-tyrosine k 


39 


116 


2. 


9 


2126 


2 


H70621 


probable polyketid 


40 


114 


2. 


8 


295 


2 


JQ1775 


SalL9R protein - v 


41 


114 


2. 


8 


861 


1 


S01142 


lipoxygenase (EC 1 


42 


114 


2. 


8 


862 


2 


T07775 


lipoxygenase (EC 1 


43 


114 


2. 


8 


3573 


2 


S23070 


erythronolide synt 


44 


112.5 


2. 


8 


862 


2 


S57964 


lipoxygenase (EC 1 


45 


111.5 


2. 


8 


1378 


1 


148751 


protein-tyrosine k 



ALIGNMENTS 



RESULT 1 
148745 

semaphorin B - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Nov-1999 
C; Accession: 14 8745 

R;Puschel, A.W.; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A;Reference number: 148744; MUID : 95267431; PMID:7748561 
A; Accession: 148745 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-760 <RES> 

A;Cross-references: EMBL:X85991; NID:g854325; PIDN : CAA59983 . 1 ; PID:gS54326 
C; Genetics : 
A; Gene: semB 

C; Super family: semaphorin 



Query Match 81.4%; Score 3280.5; DB 2; Length 760; 

Best Local Similarity 82.3%; Pred. No. 6.4e-252; 

Matches 628; Conservative 46; Mismatches 84; Indels 5; Gaps 3; 

MALPALGLDPWSLLGLFLFQLLQLLLPT— TTAGGGGQGPMPRVRYYAGDERRALSFFHQ 58 
1 ' 1 I : I I I I I I I : I I I I |||: : I I I I I I I I I I : | : | | | I II II I 
MALPSLGQDSWSLLRVFFFQL- -FLLPSLP PAS GTGGQGPMPRVKYHAGDGHRALSFFQQ 58 

KGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKK 118 
1 1 I = I I I I I I I I I I I I I I I I I I I : I I I : I I : I I : I I I I I I | | I I I I : I I I : I I I I I I I 
KGLRDFDTLLLSDDGNTLYVGARETVLALNIQNPGIPRLKNMIPWPASERKKTECAFKKK 118 

SNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPF 178 
I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I I I I | | MM II II : || | M I 



I I I I I I I I I I I I I I I I I I I I I I II I I II I MUM II II II M I I II II I 

TLFTSTQAVLVDGMLYSGTMNNFLGSEPILMRTLGSHPVLKTDIFLRWLHADASFVAAIP 238 

STQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ 298 
I I I I I I I I I I II I I II II I I I : II II : II II M II II II II I I II II I M M I MM 
STQWYFFFEETASEFDFFEELYISRVAQVCKNDVGGEKLLQKKWTTFLKAQLLCAQPGQ 298 

LPFNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELN 358 
I I I I : I I I I I I I I I I I I : II I II II I II I || I I II I II II II | || | M II I I M I I 
LPFNIIRHAVLLPADSPSVSRIYAVFTSQWQVGGTRSSAVCAFSLTDIERVFKGKYKELN 358 

KETSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRL 418 



Ov 


i 


Db 


1 


Qv 


59 


Db 


59 


Ov 


1 1 Q 

X X Zs 


Db 


11Q 
XX-/ 


Ov 
vy 


1 7 Q 




x / z? 


Ov 


0 ^ Q 
z. o z? 


Db 




Ov 
^y 


Z. Zs Z> 


Db 




Ov 


O O z? 


Db 


^SQ 

OJj 


Ov 
wy 


41 Q 

rt X J 


Db 


41 q 


Ov 
^y 


47Q 

1 / Z7 


Db 


4 7 Q 


Ov 
*iy 


539 


Db 


*J O O 


Qv 


599 


Db 


598 , 


Qy 


659 : 


Db 


658 1 


Qy 


719 ( 


Db 


718 ( 



I I M I II II I : II I I II I : I II || | || || II I II M I I II II II I I 



I I 



I I ■ - I = I 1 I I IIMIMIhll MUM I I I f : I I I I I I I II MIMIMM I 
AVESARGLDGSSHWMYLGTSTGPLHKAWPQDSSAYLVEEIQLSPDSEPVRNLQLAPAQ 478 

GAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDM 538 
I I I I I I I M : I I I I I I I I || | | M II M II I M II II I Ml : Mill 



I I I I I I I I MMII I II MMMII I II I II II I II I II II II : II I I I : 
ERGNPEWVCTRGPMARSPRRQSPPQLIKEVLTVPNSILELRCPHLSALASYHWSHGRAKI 597 

PEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPR 658 

I I hi I I I M I I I: I II I II I I II II M I : I I II : II I II M II II I II I I I Ml I 
SEASATVYNGSLLLLPQDGVGGLYQCVATENGYSYPWSYWVDSQDQPLALDPELAGVPR 657 

EHVKVPLTRVSGGAALAAQQSYWPHFVT VTVLFALVLSGALIILVASPLRALRARGKVQG 718 
I MUNI I I I ::| MM III I |: MM MM I | : I : | | I I MIMIMM 
ERVQVPLTRVGGGASMAAQRSYWPHFLIVTVLLAIVLLGVLTLLLASPLGALRARGKVQG 717 

CETLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 
Illlhllll I: I I M I t I I I I M M Ml 



RESULT 2 
148746 

semaphorin C - mouse (fragment) 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 03-Nov-2000 
C; Access ion: 14 8746 

R;Puschel, A.W. ; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A;Reference number: 148744; MUID : 95267431 ; PMID:7748561 
A; Accession: 14 8746 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-782 <RES> 

A;Cross-references: EMBL:X85992; NID:g854327; PIDN : CAA59984 . 1 ; PID:g854328 
C; Genetics : 
A; Gene: semC 

C; Superfamily : semaphorin 

Query Match 29.7%; Score 1198.5; DB 2; Length 782; 

Best Local Similarity 39.9%; Pred. No. l.le-86; 

Matches 275; Conservative 115; Mismatches 244; Indels 55; Gaps 22; 

Qy 4 9 ERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVP — RLKNMIPWPAS 106 

II: I : : : : I I II I I I I I I I I I I I : I I : : I : : : I I 

Db 1 EERLI RKFEAENI SNYTALLLSQDGKTLYVGAREALFALNSNLS FLPGGEYQELL-WSAD 59 

Qy 107 DRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISE 166 

: I : I : I I I : I I : I : : I : I : I I I i I I I I II I : I : I 
Db 60 ADRKQQCSFKGKDPKRDCQNYIKILLPLNSSHLLTCGTAAFSPLCAYIHIASFTLAQDEA 119 

Qy 167 DKVM — EGKGQ S P FDPAHKHTAVLVDGML YS GTMNN FLGS E P I LMRTLGS QPVLKTDN FL 224 

I : : I I I I I I I I | I : : | | | | | : | | : : : | | : : | : | : I : I I I : : I 
Db 120 GNVILEDGKGHCPFDPNFKSTALWDGELYTGTVSSFQGNDPAISRSQSSRPT-KTESSL 178 

Qy 225 RWLHHDASFVAAIPSTQ WY F F FEET AS E FD F FE RLHT S RVARVC KN D VG 274 

II I : I I I : I : : I I I I I I I I : I I I I I I I I I I I I I 

Db 17 9 NWL-QDPAFVASATSPESLGSPIGDDDKIYFFFSETGQEFEFFENTIVSRVARVCKGDEG 237 

Qy 275 GEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVI RHAVLL PAD S P TAP H I YAVFT S QWQ V 330 

I I : : I I : : I I : I I I I I I I I : : I MM:: | II | I I I I I I 

Db 238 GERVLQQRWTSFLKAQLLCSRPDDGFPFNVLQDVFTLNPNPQDWRKTLSI-GVFTSQWHR 296 

Qy 331 GGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSCSVGPS 384 

II I I : I I : : I : : : I I I I : : I : I I : I I I I I I : I : 

Db 297 GTTEGSAICVFTMNDVQKAFDGLYKKVNRETQQWYTETHQVPTPRPGACITNSARERKIN 356 

Qy 385 SDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAVETAQGLDGHS-HLVMYLGT 438 

hlhlllllllll II:: I I : I I II I I : I : : I II 

Db 357 SSLQLPDRVLNFLKDHFLMDGQVRSRLLLLQPRARYQRVAVHRVPGL — HSTYDVLFLGT 414 

Qy 439 TTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCS 498 

I I I I I I I I : : I I : I : I I : I I : I I I I : : I I : I I I I I I 

Db 415 GDGRLHKAVTL-SSRVHIIEELQIFPQGQPVQNLLLDSHGGLLYASSHSGWQVPVANCS 473 



Qy 



499 VYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNS — WKQDMERGNPEWACASGPMSRSL 556 



: I : I I I : I I I I I : I I I : I hi I I I I : I : : I : 

Db 474 LYPTCGDCLLARDPYCAWTGSACRLASLYQPDLASRPWTQDIEGASVKELCKNSSYKARF 533 

Qy 557 RPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASS— TVYNGSLLLI- 613 

: I I : I Ih: I I I I I I I : II II : : I : I II I : 
Db 534 LVPGKP — CKQVQIQPNTVNTLACPLLSNLATRLWVHNGAPVNASASCRVLPTGDLLLVG 591 

Qy 614 VQDGVGGLYQCWATENGFS YPVI S YWVDSQDQ-TLALDPELAGI PREHVKVPLTRVS 669 

I I : I : : I I I : I I I III : : : : : II I : : I I I 
Db 592 SQQGL-GVFQCWSIEEGFQQLVASYCPEVMEEGVMDQKNQRDGTP VIINTSRVSAPA 647 

Qy 670 -GGAALAAQQSYWPHFVTVTVL— FALVL 695 

I : I : I I I I : : I I I : I I 

Db 64 8 GGRDSWGADKSYWNEFLVMCTLFVFAMVL 676 



RESULT 3 
S66498 

M-sema F protein precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 28-Oct-1996 #sequence_revision 13-Mar-1997 #text_change 24-Nov-1999 
C; Accession: S 664 98 

R;Inagaki, S.; Furuyama, T.; Iwahashi, Y. 
FEBS Lett. 370, 269-272, 1995 

A;Title: Identification of a member of mouse semaphorin family. 

A;Reference number: S66498; MUID: 95385809; PMID:7656991 

A; Accession: S 664 98 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-834 <INA> 

A;Cross-references: EMBL:S79463; NID: glll0598 ; PIDN : AAB35184 . 1; PID:glll0599 
C;Superf amily : semaphorin 

F; 1-21/Domain: signal sequence #status predicted <SIG> 
F;22-834/Product : M-sema F protein #status predicted <MAT> 

Query Match 26.8%; Score 1079.5; DB 2; Length 834; 

Best Local Similarity 35.4%; Pred. No. 3.5e-77; 

Matches 271; Conservative 115; Mismatches 242; Indels 137; Gaps 25; 

Qy 39 MPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLK 98 

: I I : I : : I I I : I I I I I I : I I I I I I I I : I :: I : 

Db 28 VPRKTVSSGELVTWRRFSQTGIQDFLTLTLTEHSGLLYVGAREALFAFSVE ALELQ 8 4 

Qy 99 NMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQD 158 

III I I I I I I I I : I : I I I I I I I I I : I I I I I I : I I I I I : I : 

Db 85 GAI SWEAPAEKKIECTQKGKSNQTECFNFIRFLQPYNSSHLYVCGTYAFQPKCTYINM — 142 

Qy 159 SYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVL 218 

: : : I I I : I : | I I I I : I I I I I I I I : I I I II : I I : : : I : I : : 
Db 143 -LTFTLDRAEFEDGKGKCPYDPAKGHTGLLVDGELYSATLNNFLGTEPVILRYMGTHHSI 201 

Qy 219 KT DN FLRWLHHDAS FV — AAI P S T QWYFFFEETASEFDFFERLHTSRVARV 268 

II: II:: II I : I : : I I I I I I I : I : : i I I I I 

Db 2 02 KTEYLAFWL-NEPHFVGSAFVPESVGSFTGDDDKIYFFFSERAVEYDCYSEQWARVARV 2 60 

Qy 269 CKNDVGGEKLLQKKWTTFLKAQLLCTQPG-QLPFNVIRHAVLLPADSPTAPHIYAVFTSQ 327 

I I I : I I : I I I I I I I I I II : I : I : I : : I I : : 1 I : I I : : 



Db 



261 CKGDMGGARTLQKKWTTFLKARLVCSAPDWKVYFNQLKAVHTLRGASWHNTTFFGVFQAR 32 0 

Q y 328 WQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSC SV 381 

I I I I I I : I |::||:| Ml ::: :| | | :|||MI 
Db 321 W GDMDLSAVCEYQLEQIQQVFEGPYKEYSEQAQKWARYTDPVPSPRPGSCINNWHRDN 378 

QY 382 GPSS DKALTFMKDHFLMDEQV VGTPLLVKSGVEYTRLAVETAQGLDGHSHLV 433 

1 : I I I I : I I I I : : I | : I I I I I I : | : : | | | | : : | 

Db 379 GYTSSLELPDNTLNFIKKHPLMEDQVKPRLGRPLLVKKNTNFTHVVADRVPGLDGATYTV 438 

QY 434 MYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVP 4 93 

: : : I I I I I I I I I : I i I : I : I I I I I : | | : : : : | | s : : 

Db 439 LFIGTGDGWLLKAVSLG-PWIHMVEELQVF-DQEPVESLVLSQSKKVLFAGSRSQLVQLS 4 96 

Qy 494 RANCSVYESCVDCVLARDPHCAWDPESRTCCLLSA PNLNSWKQDMERGN 542 

I • I : I I I I II I I I I I : I I I : : I : : | | : : I : | 

Db 497 ^ DCT KYRFCVDCVLARDPYCAWNVNTSRCVATTSGRSGSFLVQHVANLDTSKMCNQYG- 555 

Qy 543 PEWACASGPMSRSLRPQSRPQIIKEVLAVPNSI LELPCPHLSA-LASYYWSHG 594 

nh Rtc 11:1 ::| :| I I I I I I I : I I : I : I 

DSD bbb IKKVRSIPKNITWSGTDLVLPC-HLSSNLAHAHWTFG 592 

Qy 595 PAAVP — EASSTVYNGSLLLIV QDGVGGLYQCWATENGFS YPVI S YW VDSQDQ 645 

:| : I :|: | :| | I |:|:: | | | | | 

Db 593 SQDLPAEQPGSFLYDTGLQALVVMAAQSRHSGPYRCYSEEQGTRLAAESYLVAVVAGSSV 652 

Qy 646 T L AL D P E LAG I P REH VKVP L T RVS G GAAL AAQQ S YW P H FVT VT VL FAL VL S G A LII 701 

N : | | : I I : I I I | : : 

Db 653 TL EARAPLENLG LWLAWALGAVCLVLLL 682 

Qy 702 LVAS PLRALRARGKVQGCETLRPGEKA PLSREQHLQSP 739 

I I I I I I I I I I I || : || 

Db 683 LVLSLRRRLR EELEKGAKASERTLVYPLELPKEPASP 719 



RESULT 4 
G02173 

semaphorin III family homolog - human 
C; Species: Homo sapiens (man) 

C;Date: 21-Dec-1996 #sequence_revision 06-Jun-1997 #text change 24-Sep-1999 
C;Accession: G02173 ~ 
R;Naylor, S. 

submitted to the EMBL Data Library, October 1995 
A; Reference number: G09275 
A; Accession: G02173 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-753 <NAY> 

A;Cross-references: EMBL:U38276; NID: glO 61350 ; PIDN : AAB18276 . 1; PID:gl061351 
C; Super family: semaphorin 

Query Match 20.8%; Score 838.5; DB 2; Length 753; 

Best Local Similarity 31.5%; Pred. No. 4e-58; 

Matches 230; Conservative 114; Mismatches 279; Indels 107; Gaps 30; 
Qy 26 LPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQK GLQDFDTLLLSGDG 73 



Db 



26 LPAT 



PRVR- 



LSFKELKATGTAHFFNFLLNTTDYRILLKDEDH 66 



Qy 74 NTLYVGAREAI LALDI QDPGVPRLKNMI PWPASDRKKS ECAFKKKSNETQCFNFI RVLVS 133 

: : I I I : : : : I : I I : I : I : I I II :: II I : I I I : I : : 
Db 67 DRMYVGSKDYVLSLDLHD — INREPLIIHWAASPQRIEECVLSGKDYNGECGNFVRLIQP 124 

Qy 134 YNVTHLYTCGTFAFSPACTFI ELQDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLV 189 

: I I I II I I I I : : I I I : : III: : : : : I I I : I : I I : I : 

Db 125 WNRTHLYVCGTGAYNPMCTYVNRGRRAQD-YIFYLEPERLESGKGKCPYDPKLDTASALI 183 

Qy 190 DGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAA — IPST QV 242 

: I I : I : I : I : : : I I I I I : : I I : I I I : I I I : I I I : 
Db 184 NEELYAGVYIDFMGTDAAIFRTLGKQTAMRTDQYNSRWL-NDPSFIHAELIPDSAENDDK 242 

Qy 243 VYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG Q 298 

: I I I I I : : I :: : | : | : | | I I I I I I : I I I I I : I : I : I I : 

Db 243 LYFFFRERSAEAPQSPAVY-ARIGRICLNDDGGHCCLVNKWSTFLKARLVCSVPGEDGIE 301 

Qy 299 L P FN VI RH AVL L PAD S P TAP H I YAVFT S Q WQ VGGT R S S AVC AF S L L D I E RVFKG K YK E LN 358 

I : : : : I I I I I I I I I I I I I I : I : I I Ml: 

Db 302 THFDELQDVFVQQTQDVRN PVI YAVFT S S GS V — FRGSAVCVYSMADI RMVFNGP FAHKE 359 

Qy 359 KETSRWTTYRGPETNPRPGSCSVG PS S DKALT FMKDH FLMDEQ W GT 4 05 

: I : I I I II : I I II I : : I I : I I I : I 

Db 360 GPNYQWMPFSGKMPYPRPGTCPGGTFTPSMKSTKDYPDEVINFMRSHPLMYQAVYPLQRR 419 

Qy 406 PLLVKSGVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAW SGDSSAHLVEE 459 

I I : I : : I I I : I I : II : I : : I I I I : : I : I : z : I I 

Db 420 PLWRTGAPYRLTTIAVDQVDSADGR-YEVLFLGTDRGTVQKVIVLPKDDQEMEELMLEE 478 

Qy 460 IQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDP 518 

:: : I I I I I : : : : : : : I : I I : I I : I I I I I I I I : I I I I 
Db 479 VEVFKDPAPVKTMTISSKRQQLYVASAVGVTHLSLHRCQAYGAACADCCLARDPYCAWD- 537 

Qy 519 ESRTCCLLSAPN-LNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEV-LAVPNSIL 576 

: I : I : I : I I : I i I I I : : : I II 

Db 538 - GQAC S RYTAS S KRRS RRQ DVRHGN P I RQC RG FN S NAN KNAVE S VQ YGVAG S AA 590 

Qy 577 ELPCPHLSALASYYW — SHGPA AVP EAS S T VYN G S L L L I VQ D GVGG L YQ CWAT E 628 

II I I : I I : I I I : I I I I I I I I 

Db 591 FLECQPRSPQATVKWLFQRDPGDRRREIRAEDRFLRTEQGLLLRALQLSDRGLYSCTATE 650 

Qy 629 NGFSYPV — ISYWVDSQDQT-LALDPEL — AG I P REH VKVP L T RVS GGAALAAQ 677 

I I : I : 1:1 I I I I : I I I I I I 

Db 651 NNFKHWTRVQLHVLGRDAVHAALFPPLSMSAPPPPGAGPPTPPYQELAQLLAQPEVGLI 710 

Qy 678 QSYWPH 683 

I I I I 

Db 711 HQYCQGYWRH 720 

RESULT 5 
A49069 

collapsin - chicken 

C; Species: Gallus gallus (chicken) 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text_change 24-Sep-1999 
C; Access ion: A49069 



R;Luo, Y.; Raible, D. ; Raper, J. A. 
Cell 75, 217-227, 1993 

A; Title: Collapsing a protein in brain that induces the collapse and paralysis 
of neuronal growth cones. 

A; Reference number: A49069; MUID : 94006554 ; PMID: 8402908 
A/Accession: A49069 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-772 <LUO> 

A/Cross-references: GB:U02528; NID:g410078; PIDN: AAC59638 . 1; PID:g410079 
C; Super family : semaphorin 

Query Match 20.8%; Score 838; DB 2; Length 772; 

Best Local Similarity 31.6%; Pred. No. 4.6e-58; 

Matches 203; Conservative 120; Mismatches 256; Indels 64; Gaps 19; 

QY 39 MPRVRYYAGDERRALSFFHQKGL QDFDTLLLSGDGNTLYVGAREAILALDIQDPGVP 95 

• i I : : : : : : II : I I I : : I I I I I : : | : : : : 
Db 29 VPRLKLSYKEMLESNNIVNFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFNLW 84 

QY 96 RLK— NMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTF 153 

: I INI : : I I : I : I I I I : I I : II I I I I I I I I I I I I : 

Db 85 -IKEYQKIVWPVSHSRRDECKWAGKDILRECANFIKVLKTYNQTHLYACGTGAFHPMCTY 143 

QY 154 IEL QD S YL L P I S EDKVME GKGQ S P FD PAHKHTAVLVDGML YS GTMNN FLG S E P I LMR 210 

I I : : = : : I : I : I | : | | :: I I I I I II I I : I : I : : I 

Db 144 IEVGSHPEDNIFRMEDSHFENGRGKSPYDPKLLTASLLVDGELYSGTAADFMGRDFAIFR 2 03 

Qy 211 TLGSQPVLKTDNF-LRWLHHDASFVAA IPSTQWYFFFEETASEFDFFERLHT 262 

Ml : : I : I I I : I I : : I I : I I I I I I : : : 

Db 204 TLGHHHPIRTEQHDSRWL-NDPRFISAHLIPESDNPEDDKIYFFFRENAIDGEHTGKATH 262 

QY 263 SRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG Q L P FN VI RH AVL L P AD S P TAP 318 

: I : : : I I I I I I : I I I I I I I I I : I : I : I I | : : : | : : | | 

Db 263 ARIGQICKNDFGGHRSLVNKWTTFLKT^LICSVPGPNGIDTHFDELQDVFLMNSKDPKNP 322 

Qy 319 HIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGS 378 

•Mil: : : I I I I : I : I : I I I I I : : | | : | | | | j : 

Db 323 IVYGVFTTSSNI — FKGSAVCMYSMTDVRRVFLGPYAHRDGPNYQWVPYQGRVPYPRPGT 380 

Qy 379 C S VGP S S DKALT FMKDH FLMDEQW GTPLLVKSGVEY — TRLAVETAQ 424 

I I I : : I I : I I I I : : : I : | : | | : : | : 

Db 381 CPSKTFGGFDSTKDLPDEVITFARSHPAMYNPVFPINSRPIMIKTDVDYQFTQIWDRVD 440 

QY 425 GLDGHSHLVMYLGTTTGSLHKAWSGDSSAH LVEEIQLFPDPEPVRNLQLAPTQG 479 

M : I I : : I I I : : I I : I I : I | : : | : | : : : : : | 

Db 441 AEDG-QYDVMFI GTDI GTVLKWS I PKETWHELEEVLLEEMTVFREPTVI SAMKI STKQQ 4 99 

Qy 480 AVFVGFSGGWRVPRANCSVY-ESCVDCVIARDPHCAWDPESRTCCLLSAPNL — NSWKQ 536 

: : : I : I I : : I I I I :: I : I I II I I : II I I I I I : : | 

Db 500 QLYI GS AT GVSQLPLHRCDVYGKACAECCLARDPYCAWDGSS CSRYFPTAKRRTRRQ 556 

QY 537 DMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHG — 594 

I = I • I h : I : I : I I I II I I I I 

Db 557 DIRNGDPLTHCSD— LQHHDNPSGQTLEEKIIYGVENSSTFLECSPKSQRAIVYWQFQKQ 614 

Qy 595 -PAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGF 631 



I * III • I I • I I I I • I I 

Db 615 NDDHKVEIKVDDRMIRTEQGLLLRSLQRRDSGIYFCHAVEHGF 657 



RESULT 6 
148747 

semaphorin D - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul~1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148747 

R;Puschel, A.W. ; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A;Reference number: 148744; MUID : 95267431 ; PMID:7748561 
A; Accession: 14874 7 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-772 <RES> 

A/Cross-references: EMBL:X85993; NID:g854329; PIDN : CAA59985 . 1 ; PID:g854330 
C; Genetics : 
A; Gene: seraD 

C; Superf amily : semaphorin 

Query Match 20.7%; Score 836; DB 2; Length 772; 

Best Local Similarity 30.3%; Pred. No. 6.6e-58; 

Matches 220; Conservative 124; Mismatches 279; Indels 102; Gaps 25; 

Qy 24 LLLPTTTAGGGGQGPMPRVRY-YAG— DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGA 80 

: t I I : : | | : : | : : : | : I I I - :: I I I I I 

Db 14 VLLTARANYANGKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGA 73 

Qy 81 REAILALDIQDPGVPRLKNM— IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTH 138 

:: I : :: : : I : I I I I :: | | : | : I I I I : I I : I I I I 

Db 74 KDHIFSFNLVN 1 KDFQKI VWPVS YTRRDECKWAGKDI LKECANFI KVLEAYNQTH 128 

Qy 139 LYTCGT FAFS PACTFI EL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYS 195 

I I I I I I I I I I : I I : : : : : I : I : I I : I I : : I : I I I I I 

Db 129 LYACGTGAFHPICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYDPKLLTASLLIDGELYS 188 

Qy 196 GTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAA IPSTQWYFFF 247 

II : I : I : =1111 : : I : I I I : I I : : I I Mill 

Db 189 GTAADFMGRDFAI FRTLGDHHPIRTEQHDSRWL-NDPRFI SAHLI PESDNPEDDKVYFFF 247 

Qy 24 8 EETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLPFNV 303 

M : : : I : : : I I I I I I : I I I I I I I ! I : I : I : I I I : 

Db 248 RENAIGGEHSGKATHARIGQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNGIDTHFDE 307 

Qy 304 IRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSR 363 

: s i : : I I : I I I I : : : I I I I : I : I : I I I I I : : 
Db 308 LQDVFLMNS KDPKNPI VYGVFTTS SNI — FKGSAVCMYSMS DVRRVFLGP YAHRDGPNYQ 365 

Qy 364 WTTYRGPETNPRPGSC S VGP S S D KALT FMKDH FLMD EQ W GTPLLVKS 411 

11:1 I I : I I : | I | |:::|: 

Db 366 WVPYQGRVPYPRPGTCPSKTFGGFDSTKDLPDDVITFGRSHPAMYNPVFPINNRPIMIKT 425 



Qy 



412 GVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAH LVEEIQLFP 4 64 



II I :: I : II : I I :: I I I :: I I : I I : I I : : I 

Db 426 DVNYQFTQIWDRVDAEDG-QYDVMFIGTDVGTVLKWSVPKETWHDLEEVLLEEMTVFR 484 

Qy 465 DPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTC 523 

: I : : : I : I : : : I : I I : : I I : I : : I : I I I I II : I I I I I 
Db 485 EPTTISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWDGSS 541 

Qy 524 C L L S APN L — N S WKQ DME RGN P E WAC AS GPMSRSLRPQSRPQIIKEVLAVPN 573 

I I : : i I : I : I I : II II : : | | 

Db 542 CSRYFPTAKRRTRRQDIRNGDPLTHCSDLEDHDNHHGP SLEE RIIYGVEN 591 

Qy 574 SILELPCPHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCW 625 

I I I I I I I I : : I I I : I 'II 

Db 592 SSTFLECSPKSQRALVYWQFQRRNRRSKREIRMGDHIIRTEQGLLLRSLQKKDSGNYLCH 651 

Qy 626 ATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSGG AALAA 67 6 

11:11 IN I I I I I :: I : I :: : 

Db 652 AVEHGFM QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKIKEMSSSMTP 698 

Qy 677 QQSYW 681 

I I 

Db 699 SQKVW 703 



RESULT 7 
D49423 

semaphorin III precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 24-Sep-1999 
C; Accession: D4 9423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A; Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules. 

A;Reference number: A49423; MUID: 94 094332 ; PMID: 8269517 
A; Accession: D4 9423 

A; Status: preliminary; nucleic acid sequence not shown 
A; Molecule type: mRNA 
A/Residues: 1-771 <KOL> 

A/Cross-references: GB:L26081; NID:g799328; PIDN : AAA65938 . 1 ; PID:g436560 

C; Genetics: 

A; Gene: GDB : SEMA1 

A; Cross-references : GDB: 283448 

C; Super f amily : semaphorin 

Query Match 20.5%; Score 825.5; DB 2; Length 771; 

Best Local Similarity 30.5%; Pred. No. 4.5e-57; 

Matches 216; Conservative 119; Mismatches 282; Indels 91; Gaps 24; 

Qy 35 GQGPMPRVRY-YAG--DERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQD 91 

I s : I I : : I : : : | : I I I : : I I I I I : : I : I : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVN 84 

Qy 92 P GVP RL KNM — I P W PAS D RKK S E C AFKKK S NETQCFNFI RVL VS YN VT H L YT C GT FAF S P 14 9 

: I : I I I I I I : I : I I II : I I : I I I I I I I I I I I I 

Db 85 IKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 



Qy 150 ACTFIEL QDSYLLPISEDKA/MEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEP 206 

I I • i I : : : : I : I : I I : I I : : I : I I I I I I I : I : I : 

Db 140 ICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

QY 207 I LMRTLGSQPVLKTDNF- LRWLHHDAS FVAA IPSTQWYFFFEETASEFDFFE 258 

: I I I I : : I : I I I : I I : : I | M I I I I I : : 

Db 200 AIFRTLGHHHPIRTEQHDSRWL-NDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSG 258 

Qy 259 RLHT S RVARVCKNDVGGEKLLQKPCWTT FLKAQLLCTQ PG Q L P FN VI RH AVL L PAD S 314 

• : I : : : I I I I I I : I I 1 I I I I I I : I : I : I I *' | : : : | : 
Db 259 KATHARI GQI CKNDFGGHRS LWKWTTFLKARLI CSVPGPNGI DTHFDELQDVFLMNFKD 318 

Qy 315 PTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNP 374 

I I : I I I I : : : I I I I : I : I : I I I I | : : | | : I | 
Db 319 PKNPWYGVFTTS SNI — FKGSAVCMYSMS DVRRVFLGP YAHRDGPN YQWVP YQGRVP YP 376 

Qy 375 RPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKSGVEY — TRLAV 420 

IIIH I I : I I : I I I |:::|: I I I:: | 

Db 377 RPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVIKTDVNYQFTQIW 436 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHLVEEIQLFPDPEPVRNLQLA 475 

: II • I I : 5 I I I : : I I I I : I I : : I : I : : : I : 

Db 437 DRVDAEDG-QYDVMFI GTDVGTVLKVVS I PKETWYDLEEVLLEEMTVFREPTAI SAMELS 495 

Qy 476 PTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNL — N 532 

I : : : I : I I I I : I : : I : | | | | | | : | | | | : | | 
Db 496 TKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWD GSACSRYFPTAKRR 552 

Qy 533 SWKQDMERGNPEWACAS— GPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYY 590 

: : I I : I • I I : I : I : I I I II III 

Db 553 TRRQDIRNGDPLTHCSDLHHDNHHGHSPEER IIYGVENSSTFLECSPKSQRALVY 607 

Qy 591 WS HGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDS 642 

I Is IN: 11111:11 
Db 608 WQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFI 657 

Qy 643 QDQTLALDPELAGIPREHVKVPLTRVSGG AALAAQQSYW 681 

I I I I I | | | :: | : | :: | | 

Db 658 --QTL-LKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVW 702 



RESULT 8 
G01856 

semaphorin V - human 

C; Species: Homo sapiens (man) 

C;Date: 21~Dec-1996 #sequence_revision 06-Jun-1997 #text_change 24-Sep-1999 
C; Accession: G01856 
R;Sekido, Y. 

submitted to the EMBL Data Library, June 1995 
A; Reference number: G08634 
A; Accession: GO 18 56 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-749 <SEK> 

A;Cross-references: EMBL:U28369; NID:g974283; PIDN: AAD09138 . 1 ; PID:g974284 
C; Super family: semaphorin 



Query Match 20.3%; Score 820; DB 2; Length 749; 

Best Local Similarity 31.2%; Pred. No. 1.2e-56; 

Matches 207; Conservative 116; Mismatches 254; Indels 86; Gaps 24; 

32 GGGGQGPMPRVRYYAGDERRALS F FHQKGLQDF DTLLLSGDGNTLYV 78 

II II I IN :l III I ||: : |:| 

Db 20 GLGSAAPSP PRLRLSFQELQAWH — GLQTFSLERTCCYQALLVDEERGRLFV 69 

79 GAREAILALDIQDPGVPRLKNMI PWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTH 138 
'I : :h: : : : : III : || : | | : | || ::: | : ,| ,, 
Db 70 GAENHVASLNLDN ISKRAKKLAWPAPVEWREECNWAGKDIGTECMNFVKLLHAYNRTH 127 

®y 139 LYTCGTFAFS P ACTFI EL QDS YLLPI S EDKVMEGKGQS PFDPAHKHTAVLVDGMLYS 195 

I I I I I I I I I : I : : : I : : : : I ! | : || : || | : : | | | | | | 
Db 128 LLACGTGAFHPTCAFVEVGHRAEEPVLRLDPGRIEDGKGKSPYDPRHRAASVLVGEELYS 187 

196 GTMNNFLGSEPILMRTLGSQPVLKTD-NFLRWLHHDASFVAAI PSTQWYFFF 247 

I : : I : : I : I I : I I : I : : III:: | | | : | | | | 

Db 188 GVAADLMGRDFTIFRSLGQRPSLRTEPHDSRWL-NEPKFVKVFWIPESENPDDDKIYFFF 24 6 

Qy 248 EETASE-FDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQP GQLPFNV 303 

III I M I I I : : I : I I I I I : : I M I I I I I I : I : I : | | | : 

Db 247 RETAVEAAPALGRLS VS RVGQI CRNDVGGQRS LVNKWTT FLKARLVC S VP GVEGDTHFDQ 306 

Q y 304 1 RHAVLL PAD S PTAPHI YAVFT S QWQVGGT RS S AVCAFS LLDI ERVFKGK YKELNKET S R 363 

: : I I : I : I I I I : : : : II I I : I : I : I I | : 

Db 307 LQDVFLLS S RDHRT PLL YAVFST S S S I — FQGSAVC VYSMNDVRRAFLGP FAHKEGPMHQ 364 

Qy 364 WTTYRGPETNPRPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKS 411 

' : ' : ' UN I I | : | ::| || | : | ,| :s 

Db 365 WVSYQGRVPYPRPGMCPSKTFGTFSSTKDFPDDVIQFARNHPLMYNSVLPTGGRPLFLQV 424 

Qy 412 GVEY TRLAVETAQGLDGHSHLVMYLGTTTGSLHK — AWSG DSSAHLVEEIQLFP 464 

I I I = = I • III : l:::|| I : : I :| | : |:||: :| 

Db 425 GAN YT FTQI AADRVAAADGH- YDVLFI GT DVGTVLKVI S VPKGS RP SAEGLLLEELHVFE 483 

Qy 465 DPEPVRNLQIAPTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTC 523 

I I : : I : : : : : I | : : | - s I : I I II I I : I I I I 
Db 484 D S AAVT S MQ I S S KRH Q L YVAS RS AVAQ I ALHRCAAHGRVCT E C C LARD P YCAWD GVA 540 

Q y 524 C LL S APN LN S WKQ DME RGN P E WAC AS GPMSRSLRPQSRPQIIK- E VL AVPN S I L E L P C 580 

I 1= Ml: hi |: III ::: :| | | | | 

Db 541 CTRFQPSAKRRFRRQDVRNGDPSTLCSG DSSRPALLEHKVFGVEGSSAFLEC 592 

Qy 581 PHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFS 632 

'I I : l I : III:: hi I I I II: 

Db 593 EPRS LQARVEWTFQRAGVTAHTQVLAEERTERTARGLLLRRLRRRDSGVYLCAAVEQGFT 652 

Qy 633 YPV 635 

I : 

Db 653 QPL 655 



RESULT 9 
148748 

semaphorin E - mouse 

C; Species: Mus musculus (house mouse) 



C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 14 874 8 

R;Puschel, A.W. ; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A;Reference number: 148744; MUID : 95267431 ; PMID:7748561 
A; Accession: 14 8748 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-751 <RES> 

A;Cross-references: EMBL:X85994; NID:g854331; PIDN: CAA5998 6. 1; PID:g854332 
C; Genetics: 
A; Gene: semE 

C; Superfamily: semaphorin 

Query Match 19.9%; Score 802.5; DB 2; Length 751; 

Best Local Similarity 31.2%; Pred. No. 2.9e-55; 

Matches 219; Conservative 119; Mismatches 264; Indels 101; Gaps 26; 

QY 13 LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRA LSFFHQKGLQDFDT 66 

I : I : I : : I I I I I I I I I I I : I : 

Db 9 LVGVFICSI CVRGSSQPQARV-YLTFDELRETKTSEYFSLSHQQ--LDYRI 56 

QY 67 LLLS GDGNTLYVGAREAI LALDI QDPGVPRLKNMI PWPAS DRKKS ECAFKKKSNETQC FN 12 6 

M : I : : I I I : : : I I : I : I : | : I I I I | | | | | | 

Db 57 LLMDEDQDRIYVGSKDHILSLNINNISQEPLS — VFWPASTIKVEECKMAGKDPTHGCGN 114 

Qy 127 FIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVM EGKGQSPFD 179 

I : I i : : : I I I I I I I : I I I I I I : : I I I : I I I I : | : 

Db 115 FVRVIQTFNRTHLYVCGSGAFSPVCTYLNRGRR SEDQVFMIDSKCESGKGRCSFN 169 

QY 180 PAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHH DASFV 234 

I : I : : : I : I I : I : I : : : I : I : | : | | : | | | | : 

Db 170 PNVNTVSVMINEELFSGMY1DFMGTDAAI FRSLTKRMQLRTDQHNSKWLSEPMFVDAHVI 229 

QY 235 AAI PSTQWYFFFEETAS EFDFFERLHT S RVARVCKNDVGGEKLLQKKWTT FLKAQLL 292 

I : I I I I I : I : : : : I : I I : I I I I I : : | I I I I I I I I : I : 
Db 230 PDGTDPNDAKVYFFFKERLTDNNRSTKQIHSMIARICPNDTGGQRSLVNKWTTFLKARLV 289 

QY 293 CT Q P G - Q L P FN VI RH AVL L PAD S P TAP H I YAVFT S Q WQ VG GT R S S AVCAF S L L D I E R 348 

I : : I : I : = II I : I : I : I I : I : I I I I : I I I : 
Db 290 CSVTDEDGPETHFDELEDVFLLETDNPRTTLVYGIFTTSSSV — FKGSAVCVYHLSDIQT 347 

QY 349 VFKGKYKELNKETSRWTTYRGPETNPRPGSCSVGPSS DKALTFMKDHFLM 398 

III: : : I : I II I I : I I : I : I I :: : I I I 

Db 348 VFNGPF7\HKEGPNHQLISYQGRIPYPRPGTCPGGAFTPNMRTTKDFPDDWTFIRNHPLM 407 

Qy 399 DEQW— GTPLLVKSGVE— YTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAW S 449 

: I I : I : | : I I : : I I : | | I I : : I I I I : : I I I I 

Db 4 08 YNSISPIHRRPLIVRIGTDYKYTKIAVDRVNAADGRYH-VLFLGTDRGTVQKVWLPTNS 4 66 

QY 450 GDSSMLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVL 508 

I : : I I : : : I : I : : : : : : : : | : | | : | | : | : | | | | 

Db 467 SAS GEL I LEELEVFKNHVP I TTME I S S KKQQL YVS SNEGVSQVS LHRCHI YGTACADCCL 526 



Qy 



509 ARDPHCAWDPESRTCCLLSAP — NLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIK 566 



Db 

Qy 

Db 

Qy 

Db 



1111=1111 ! I I I :||: Ml | : :::(:: 
527 ARDPYCAWDGHS CSRFYPTGKRRSRRQDVRHGNPLTQCRGFNLKAY RNAAEIVQ 580 

567 EVLAVPNSILELPCPHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVG 619 

ll:MIIII : I I I: III 

581 YGVRNNSTFLECAPKSPQASIKWLLQKDKDRRKEGKLNERIIATSQGLLIRSVQDSDQ 638 

620 GLYQCWATENGFSYPVISYWVDSQDQTLA LDPELAGI 656 

IN I MM I MM MM: 

639 GL YH C I AT EN S FK QT I AK I N FKVL D S EMVAV 669 



RESULT 10 
148744 

semaphorin A - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148744 

R;Puschel, A.W.; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 

creates domains inhibitory for axonal extension. 

A; Reference number: 148744; MUID: 95267431; PMID:7748561 

A; Access ion: 14 8744 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-748 <RES> 

A;Cross-references: EMBL :X85990 ; NID:g854323; PIDN : CAA59982 . 1 ; PID:g854324 
C; Genetics: 
A; Gene: semA 

C;Superfamily : semaphorin 



Query Match 19.5%; Score 784.5; DB 2; 

Best Local Similarity 31.1%; Pred. No. 7.7e-54; 
Matches 210; Conservative 117; Mismatches 281; 



Length 748; 
Indels 67; 



Gaps 23; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



31 AGGGGQGP-MPRVR— YYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILAL 8 7 

Ml I : I I M : I : | : : : I I : : I : I I I : : | 

19 AGLGDTAPNLPRLRLSFQELQARHGVRTFRLERTCCYEALLVDEERGRLFVGAENHVASL 78 

88 DIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAF 147 

: : : : MM : II : I I M I I : I : I : II || | I I I I 
79 SLDN 1SKRAKKLAWPAPVEWREECNWAGKDIGTECMNFVRLLHAYNHTHLLACRTGAF 136 

148 SPACTFIELQDSYLLPIS — EDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSE 205 

II : I : I : : II 1 : : I : I I I : MM I I II : : I : 

137 HPTCALWRWATAGGTHASTGPEKLEDGKGKTPYDPRHRPPSVLVGEELYSGVTADLMGRD 196 

206 PILMRTLGSQPVLKTD-NFLRWLHHDASFVAAI P STQWYFFFEETAS E- FDF 256 

M M I MM:: III:: | | | : M I I I M I 

197 FTIFRSLGQNPSLRTEPHDSRWL-NEPKFVKVFWIPESENPDDDKIYFFFRESAVEAAPA 255 

257 FERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQP GQLPFNVI RHAVLLPAD 313 

I : II I : : I : II : I I : : I II I I I I II M M : I I I : : : I I : 
256 MGRMSVSRVGQICRNDLGGQRSLVNKWTTFLKARLVCSVPGVEGDTHFDQLQDVFLLSSR 315 



Qy 



314 SPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETN 373 



UD 


316 


' : 1 1 II : : 1 : 1 1 II : 1 : | : | | | | : | : | : | 
DRQTPLLYAVFST — SSGVFQGSAVCVYSMNDVRRAFLGPLPHKEGPTHQWVSYQGRVPY 373 


Qy 


374 


PRPGSC SVGPSSDKALTFMKDHFLMDEQW GTPLLVKSGVEY — TRLA 419 

1 1 1 1 1 1 1 : 1 : : 1 | I | : I I | : : | | | : : | 
PRPGMCPSKTFGTFSSTKDFPDDVIQFGRNHPLMYNPVLPMGGRPLFLQVGAGYTFTQIA 433 


Db 


374 


Qy 


420 


VETAQGLDGHSHLVMYLGTTTGSLHK — AWSG DSSAHLVEEIQLFPDPEPVRNLQL 474 

: III : h::|| |:: I :| | :| |:||:|:| | : ::|: 
ADRVAAADGH-YDVLFIGTDVGTVLKVISVPKGRRPNSEGLLLEELQVFEDSAAITSMQI 492 


Db 


434 


Qy 


475 


APTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPNLNS 533 
: : : : 1 1 : : 1 : : 1 : 1 1 1 1 1 1 : 1 1 1 1 : | | 
SSKRQQLYVASRAAVAQIALHRCTALGRACAECCLARDPYCAWD GSACTRFQPTAKR 54 9 


Db 


493 


Qy 


534 


--WKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYW 591 

: N: 1:1 1 II 1 1: : |:|| | : 

RFRRQDIRNGDPSTLC-SGDSSHSV LLEKKVLGVESGSAFLECEPRSLQAHVQW 602 


Db 


550 


Qy 


592 


SHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYP VISYW 639 

*l i ill i • i i i i i t i i i 


Db 


603 


• 1 1 |||.. hi 1 II IN | | : 

TFQGAGEAAHTQVLAEERVERTARGLLLRGLRRQDSGVYLCVAVEQGFSQPLRRLVLHVL 662 


Qy 


640 


VDSQDQTLALDPELA 654 

: 1 : II II 
SAAQAERLARAEEAA 677 


Db 


663 



RESULT 11 
158169 

semaphorin III - mouse (fragment) 

C; Species: Mus musculus (house mouse) 

C;Date: 26-Jul-1996 #sequence_revision 26-Jul-1996 #text_change 24-Sep-1999 
C;Accession: 158169 

R;Messersmith, E.K.; Leonardo, E.D.; Shatz, C.J.; Tessier-Lavigne, M. ; Goodman, 
C.S.; Kolodkin, A. L. 
Neuron 14, 949-959, 1995 

A; Title: Semaphorin III can function as a selective chemorepellent to pattern 
sensory projections in the spinal cord. 

A; Reference number: 158169; MUID: 95267432; PMID: 7748562 
A; Accession: 158169 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-666 <RES> 

A; Cross-references: GB:L40484; NID:g703189; PIDN : AAA73934 . 1 ; PID:g703190 

C; Genetics: 

A; Gene: Semalll 

C; Super f amily: semaphorin 

Query Match 19.0%; Score 766; DB 2; Length 666; 

Best Local Similarity 31.4%; Pred. No. 1.9e-52; 

Matches 195; Conservative 102; Mismatches 232; Indels 92; Gaps 21; 

Qy 123 QCFNFI RVLVS YNVTHLYTCGTFAFS PACTFI EL QDS YLLPI SEDKVMEGKGQS PFD 179 

: I I I I : I I : II I I I I I I I I I I I I : I I : : : : : I : I : I I : I 

Db 7 ECANFIKVLEAYNQTHLYACGTGAFHPICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYD 66 



Qy 

Db 

Qy 237 
Db 

Qy 

Db 



180 ^KHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAA-- 236 
,. 1 • • I : I I Mill I I : | : : | | | | • . i . . , . . , , . . , 

67 PKLLTASLLIDGELYSGTAANE^GRDFAIFRTLGHHHPIRTEQHDSRWL-NDPRFISAHL 125 
37 IPSTQVVYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQL 291 

126 IPESDNPEDDKVYFFFRENAIDGEHSG^THARIGQICKNDFGGH 185 
292 LCTQPG----QLPFNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIE 347 
186 I CS VPGPNGI DTHFDELQDVFLMNSKDPKN P I VYGVFTTS SNI — FKGSAVCMYSMS DVR 243 

QY 348 RVFKGKYKELNKETSRWTTYRGPETNPRPGSC SVGPS S DKALT FMKDHFLM 398 

Db 244 RVLLGPYAHRDGPNYQWPYQGRVPYPRPGTCPSKTFGGFDkKDLPDDviiFARsipAi 303 

27 3 " DEQ 7--- GT ^^f G ^--™LAVETAQGLDGHSHLVMYLGT TT GSLHKAWSGDSS 453 

Db 304 YNPVFPINNRPmiKTDWYQFiQIWDRVDAEDG-QYDWFUiDVGT^UsvPKET 362 

QY 4M ^""-^^ Q ^ PD ^ P y^2LAPTQGAVFVGFSGGWRVPRANCSVY-ESC 507 

Db 363 WHDLEEVLLEEMTVFREPTTISAMELSTKQ^ 422 

QY 508 LARDPHCAWDPESRTCCLLSAPNL NSWKQDMERGNPEWACAS GPMSRSLR 557 

Db 423 ^RDPYCAWDGS S ---CSRYFP T AKRRTRRQDIRNGDPLTHCSDLQHHDNHHGP---s!e 476 

558 PQSRPQIIKEVLAVPNSILELPCPHLSALASYYWS HGPAAVPEAS STVYNGS 609 

477 E RIIYGV ENSSTFLECSPKSQRALVYWQFQRRNEDRKEEIKMGDHIIRTEQGL 529 
610 LLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVS 669 
530 LLRSLQKKDSGNYLCHAVEHGFM QTL-LKVTLEVIDTEHLEELLHKDD 576 

Qy 670 GG AALAAQQSYW 681 

I : : : | | 

Db 577 DGDGSKIKEMSSSMTPSQKVW 597 

RESULT 12 
JC5928 

semaphorin F precursor - human 
C; Species: Homo sapiens (man) 

c;l":»io;;f SL'e 08-M, y -l 99 8 #text_ch ang e n-Hov-2000 

ZltlllTol.~s JC5926; ■" 1BI9 "»"«» ™ID;946427 8 
A; Status : nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-1074 <SIM> 

A; Cross-references : GB:U52840; NID: g2772583; PIDN: AAC09473 . 1; PID:g2772584 



Qy 

Db 477 E 

Qy 



A; Experimental source: brain 

C; Comment: This protein disrupts normal brain development and leads to some of 
the features of Cri-du-chat. 
C; Genetics : 
A; Gene: semaf 

C; Super family: human semaphorin F; thrombospondin type 1 repeat homology 
F;l-20/Domain: signal sequence #status predicted <SIG> 
F;50-533/Domain: semaphorin #status predicted <SEM> 
F;840-896/Domain: thrombospondin type 1 repeat homology <THR3> 
F; 97 1-9 9 3/ Domain : transmembrane #status predicted <TMM> 



Query Match 16.8%; Score 677.5; DB 2; Length 1074; 

Best Local Similarity 34.1%; Pred. No. 4.1e-45; 

Matches 170; Conservative 87; Mismatches 207; Indels 35; Gaps 18; 



Qy 


56 


FHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMI PWPASDRKKSECAF 
II Ml 1 1 1 II : 1 : : 1 | : | : | | 
FRAKNAADFSQLTFDPGQKELWGARNYLFRLQLEDLS LIQAVEWECDEATKKACYS 


115 


Db 


50 


106 


Qy 


116 


KKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQ 
1 1 1 1 : 1 1 : 1 1 I I : 1 : 1 1 1 1 | 1 : | | | | | | | : s | : 
KGKS KE- ECQNYI RVLL- VGGDRLFTCGTNAFT PVCTNRS L — SNLAE I HDQ — I S GMAR 


175 


Db 


107 


160 


Qy 


176 


S P FD P AHKHT AVL - VD GML YS GTMNN FLG S E P I LMRT LG S Q P VLKT DN F- L RWLHH DAS F 
1 : 1 1 1 1 : 1 1 1 1 : 1 : 1 1 : 1 : 1 : M 1 1 : 1 : : | | : : : | 
CPYSPQHNSTALLTAGGELYAATAMDFPGRDPAIYRSLGILPPLRTAQYNSKWL-NEPNF 


233 


Db 


161 


219 


Qy 


234 


VAAIPSTQVVYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLC 
1 : : 1 1 1 1 1 1 1 I : : llll|||||:|| I I : 1 1 M : II : 1 1 
VSSYDIGNFTYFFFRENAVEHDCGKTVF-SRAARVCKNDIGGRFLLEDTWTTFMKARLNC 


293 


Db 


220 


278 


Qy 


294 


TQPGQLPF NVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFK 
: : 1 1 : : N I : : | | II : | | : | : I I 1 1 1 : I I : | 
SRPGEVPFYYNELQSTFFL PELDLIYGIFTT — N VN S I AAS AVC VFN L S AI AQ AFS 


351 


Db 


279 


332 


Qy 


352 


GKYKELNKETSRWTTYRGPETNPRPGSCSVG P S S DKALT FMKDH FLMDEQ WG 4 04 

1:1 1 1 1 I • . i - i . II... 


Db 


333 


■ ■ i ; i i : | : : : | 

GPFKYQENSRSAWLPYPNPNPHFQCGTVDQGLYVNLTERNLQDAQKFILVHEW-QPVTT 


391 


Qy 


405 


TPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAW — SGDSSAHLVEEIQL 
1 :: HI: II : 1:: II 1 1 : : I | : ||: |:|l|:| 
VPSFMEDNSRFSHVAVDWQGREALVHII-YLATDYGTIKKVRVPLNQTSSSCLLEEIEL 


462 


Db 


392 


450 


Qy 


463 


FPD--PEPVRNLQIAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPES 


520 


Db 


451 


H= M:|:||: :| :||l 1 ::| 1 1 : I: |:||:| |l 
FPERRREPIRSLQILHSQSVLFVGLREHWKIPLKRCQFYRTRSTCIGAQDPYCGWDWM 


510 


Qy 


521 


RTCCLL- SAPNLNSWKQDM 538 




Db 


511 


: 1 1 : : : 1 : I : 
KKCTSLEESLSMTQWEQSI 529 





RESULT 13 
JH0798 

fasciclin IV precursor - American bird grasshopper 

C; Species: Schistocerca americana (American bird grasshopper) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 07-May-1999 



C; Accession: JH0798 

R;Kolodkin, A.; Matthes, D.J.; O'Connor, T.P.; Patel, N.H.; Admon, A.; Bentley, 
D. ; Goodman, C. S . 
Neuron 9, 831-845, 1992 

A; Title: Fasciclin IV: Sequence, expression, and function during growth cone 
guidance in the grasshopper embryo, 

A/Reference number: JH0798; MUID : 93040225 ; PMID:1418998 
A; Access ion: JH0798 
A; Molecule type: mRNA 
A; Residues: 1-730 <KOL> 

A; Cross-references : GB:L00709; NID:gl60844; PID:gl60845 
A; Experimental source: embryo 

C; Comment: This protein plays a role in growth cone guidance in the developing 
central nervous system. 

C;Keywords: glycoprotein; transmembrane protein 
F; 1-22/Domain: signal sequence #status predicted <SIG> 
F;23-730/Product: fasciclin IV #status predicted <MAT> 
F;23-627/Domain: extracellular #status predicted <EXT> 
F; 62 8- 652 /Domain : transmembrane #status predicted <TMM> 
F; 653-730/Domain: intracellular #status predicted <INT> 

F;44, 71, 163,267, 360, 539/Binding site: carbohydrate (Asn) (covalent) ((status 
predicted 



Query Natch 16.3%; Score 657.5; DB 2; Length 730; 

Best Local Similarity 29.5%; Pred. No. 8.9e-44; 

Matches 213; Conservative 115; Mismatches 246; Indels 149; Gaps 34; 



Qy 


41 


RVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNM 100 
It: : 1:1 1 II 1 hi MM : : ::| : 


Db 


37 


RVQRFLGNESHKDHF KLLEKDHNSLLVGARNIVYNI SLRDL-TEFTEQR 84 


Qy 


101 


IPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQD-S 159 

1 1 2 1 : 1 1 1 1 1 1 1 : II 1 1 : : Nihil | : | 

I EWHS S GAHRELC YLKGKS - EDDCQN YI RVLAKI DDDRVLI CGTNAYKPLCRHYALKDGD 143 


Db 


85 


Qy 


160 


YLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLK 219 
1 = = s 1 1 1 : 1 1 1 1 1 1 II: : 1 1 1 1 1 : : I I : : | : : | | : 
YW EKEYEGRGLCPFDPDHNSTAIYSEGQLYSATVADFSGTDPLIYR GPLRT 195 


Db 


144 


Qy 


220 


TDNFLRWLHHDASFVAAIPSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLL 27 9 

* 1 : 1 : -II : : : 1 1 1 1 1 1 1 : : 1 1 1 1 1 1 1 1 : 1 1 1 
ERSDLKQL-NAPNFVNTMEYNDFIFFFFRETAVEYINCGKAIYSRVARVCKHDKGGPHQF 254 


Db 


196 


Qy 


280 


QKKWTTFLKAQLLCTQPGQLP— FNVIRHAVLLPAD S P TAP H I YAVFT S QWQ V 330 

MI:IM::| 1: II | || |: :| II III: 1 
GDRWTSFLKSRLNCSVPGDYPFYFNEIQST SDIIEGNYGGQVEKLIYGVFTT — PV 308 


Db 


255 


Qy 


331 


GGTRS SAVCAFS LLDI ERVFKGKYKELNKET SRW TTYRGPETNPRPGSCSVGPS 384 

1 1 I 1 1 M : 1 1 1 : I I || : : | | I I I I | | | 
NSIGGSAVCAFSMKSILESFDGPFKEQETMNSNWLAVPSLKVPE— PRPGQC-VNDSRTL 365 


Db 


309 


Qy 


385 


SDKALTFMKDHFLMDEQWG TPLLVKSGVEY — TRLAV-ETAQGLDGHSHLVMYLGT 438 

1 ' = 1 : 1 1 1 II 1 1 1 : 1 : : :: I 1 : : I I : : 1 1 : : I : : : I I 
PDVSVNFVKSHTLMDEAVPAFFTRPI LI RI S LQYRFTKI AVDQQVRTPDGKAYDVLFI GT 425 


Db 


366 


Qy 


439 


TTGSLHKAV VSGDSS AHLVEEIQLFPDPEPVRNLQLAPTQG — AVFVGFS GG — 4 88 



Db 


426 




/IOC 

485 


Qy 


489 


- VWRVP RAN C S VYE S C VDC VL ARD P H CAWD PESRTCCLL S APN LN S WK QDMERGNP 

LAIKLHRCGSDKTTNrRKrvST.nDPYrAWDNVPT \^r tr V7wrc q id pitatc a c* vm~> tpt amt c< t 

j.vxji±j.\\^\jo i-'ivj. ± in ^imii^ v o j-i^i^rr i ^swPii uvi v JujIjivo 1/iVborUWijAbKRRF IQNI SLGEH 


543 


Db 


486 


545 


Qy 


544 


EWACASGPMSRST.RPO^RPHT TKFVT.AVPKT^T T T?T PrPtTT QAT 7\ CVVT,7 0Tl^m\ 71 1 mmv or. 

: 1 1 |||: | : : | | ' 1 1 : 


603 


Db 


546 


566 


Qv 


604 


TVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPEL AGIPR 


658 


Db 


567 


1=1 II ::l :||: |: 


596 


yy 


coy 


EHVKVPLTRVS GGAALAAQQ S YW P H FVT VT VL FAL VL S — GALII-LVASPLRALRA 

: 1 : 1 1 I : | : | | : | : | : | | | | | : : : : | : | 
SNV-IPNTLAEINHAGSKLPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFSRRC 


712 


Db 


597 


655 


Qy 


713 


RGK 715 




Db 


656 


1 1 : 

RGE 658 





RESULT 14 
T27165 

hypothetical protein Y54E5B.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence__revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T27165 
R;Lennard, N. 

submitted to the EMBL Data Library, October 1998 
A; Reference number: Z20321 
A; Accession: T27165 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-712 <WIL> 

A;Cross-references: EMBL : AL032653; PIDN : CAA21714 . 1 ; GSPDB : GN00019 ; CESP : Y54E5B . 1 

A; Experimental source: clone Y54E5B 

C; Genetics : 

A; Gene: CESP : Y54E5B . 1 

A; Map position: 1 

A;Introns: 36/1; 70/2; 85/3; 194/2; 221/1; 253/3; 301/1; 352/3; 522/2; 560/1; 
599/3; 655/3 



Query Match 15.4%; Score 621.5; DB 2; Length 712; 

Best Local Similarity 31.1%; Pred. No. 6.2e-41; 

Matches 196; Conservative 97; Mismatches 233; Indels 105; Gaps 29; 

Qy 68 LLS GDGNTLYVGAREAI LALDI QDPGVPRLKNMI PWPAS DRKKS ECAFKKKSNETQC FNF 127 

M: M::| : I : I : | | II I II :| I |: 

Db 57 L LAAD GD S LLVGARNAVYNL S L S T L S V NHKIDWKPPAEHIEECIMKGKS-KTDCQNY 112 

QY 128 I RVLVS YN VTHLYTCGTFAFS PACTFI ELQDS YLLPI SEDKVMEGKGQSPFDPAHKHTAV 187 

llll : III Mil I : : | : :|:| ||:|| | :|: 

Db H3 I RVLARKS AGVS LVCGTHAFS PKCRE YT VTE FGIRNTRQFDGQGISPYDPKHNSSAL 169 



Qy 



188 LVDG MLYSGTMNNFLGSEPILMR-TLGSQPVLKTDNFLRWLHHDA SFVAAIP 238 



II I : I : : | : | : : : : | | : I I : : I : | | : | | | 

Db 170 YVPGTNQLFVATVTDFVGNDALIYRKTIDETPSSKSAANIRTQSYDARVLNAPNFVATFA 229 

QY 239 STQWYFFFEETASE-FDFFERLHT-SRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQP 296 

: I I I : I I I I I I I : I I I I I I I II I I : : : I I : :.| | | : | | : | 

D b 230 YKEHVYFWFREIASEAIDNNEEPQIYARVARVCKNDKGGARPANERWTSYLKARLNCSLP 28 9 

QY 297 - GQLP- - FNVI RHAVLL PADS PTAPH- 1 YAVFT SQWQVGGTRS S AVCAFSLLDI ERVF- K 351 

I I I I : : I I II: I : I I I : : I I I I I I I : I | 

D k 290 S GS S P F YFNELK-AVS DP I DAGNNNHWYT VFS T — PDS DVRMS AVCKFSMKKI REEFDN 34 6 

QY 352 GKYKELNKETSRWTTY-RGPETNPRPGSCSVGPSSDK ALTFMKDHFLMDEQV — VG 4 04 

I • I I II : I I I I I I I I III : : I : I I : : | 

Db 347 GTFKHQNNAQSMWMAFNRNEVPKPRPGSCS — PDSTKLPENTVSFILHHPLLHRPIPSVA 404 

QY 4 05 TPLLVKSG VEYTRLAV-ETAQGLDGHSHLVMYLGTTTGSLHKAVVSGDSSAHLVEEIQ 4 61 

MM: : I : : | : : | | : : : : : : | | : | : | | | | : | : : : 

Db 4 05 APLLVEGADRADLTQITVLPRVRAVGGHNYDILFIGTSDGKVLK-WEVDGNATVIQSAT 4 63 

Qy 462 LFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESR 521 

: I I : I I I : : I : : : : I I I : II II MINIM 
Db 464 VFQRGVPIVN--LLTTKESWIVSADEIASLPVHNCAQQTSCSKCVQLQDPHCAWDSSIA 521 

Qy 522 TCCLLSAPNLNSWK QDMERGNPEWACASG PMSRSLRPQ 559 

I : I I 1*1.1 I I I Mill: 

Db 522 RCV HGGSWTGDQFIQNMVFGQSE-QCPEGIIVREVFDDNESEAQPEAVSRSGYPK 575 

Qy 560 SRPQIIKEVLAVPNSILELPC PHLSALA S YYWSHGPAA 597 

I : : I I:: I II II: I I I I I 

Db 576 EH S T I T WLVAAVAS L I S L 1 1 GAFI GI RVNRWAAT S E PHRS AS S T S GS D YDS FGRARLTR 635 

Qy 598 VPEASSTVYNGSLLLIV 614 

II:: : I II::: 
Db 636 HDSLTTATKVDHGFVPQSKQSVDATSLVMSI 666 



RESULT 15 
A49423 

semaphorin I precursor - beetle (Tribolium confusum) 
C; Species: Tribolium confusum 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_change 07-May-1999 
C;Accession: A49423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A; Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules. 

A; Reference number: A49423; MUID : 94094332 ; PMID: 8269517 
A; Accession: A49423 

A; Status: preliminary; nucleic acid sequence not shown; not compared with 

conceptual translation 

A; Molecule type: mRNA 

A; Residues: 1-711 <KOL> 

A; Cross-references : GB:L26080 

Query Match 15.1%; Score 609.5; DB 2; Length 711; 

Best Local Similarity 31.4%; Pred. No. 5.5e-40; 

Matches 169; Conservative 103; Mismatches 214; Indels 53; Gaps 22; 



Qy 67 LLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMI PWPASDRKKSECAFKKKSNETQCFN 126 

: : I : I = = I I I : III | Mhll I I I : : : II 

Db 50 IVLNQDETSILVGGRNRVYNLSIFDLS-ERKGGRIDWPSSDAHGQLCILKGKTDD-DCQN 107 

Qy 127 FIRVLVSYNVTHLYTCGTFAFSPAC-TFIELQDSYLLPISEDKVMEGKGQSPFDPAHKHT 185 

: I I : I I I M I : : I I I : : | | : : I : | | I I : : I I I 
Db 108 YI RI L YS SEPGKLVI CGTNS YKPLCRT YAFKEGKYLV EKEVEGIGLCPYNPEHNST 163 

QY 186 AVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPSTQWYF 245 

: I = I I : I I : : I I : I : : I : I : I : I : : I I : : : : | 

Db 164 SVSYNGQLFSATVADFSGGDPLIYR EPQRTELSDLKQL-NAPNFVNSVAYGDYIFF 218 

QY 246 FFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLPF 301 

I : I I I I : : : I I I I I I I I : I I I : : I I : I I I I : I I : I I : | | 
Db 219 FYRETAVE YMNCGKVI YS RVARVCKDDKGGPHQS RDRWT S FLKARLNCS I P GE YP FYFDE 27 8 

QY 3 02 NVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKE 356 

: : : : I : I I : I : I I I : I I : : I I I I I : I : I 

Db 279 IQSTSDIVEGRY NS DD S KI I YGI LTT — PVNAI GGS AI CAYQMADI LRVFEGS FKH 332 

QY 357 LNKETSRWTTYRGPET NPRPGSCSVGPS SDKALTFMKDHFLMDE — QWGTPLL 408 

II I : I I I I I I I I I : I : I I | | : : : | | : | 

Db 333 QETINSNWLPV— PQNLVPEPRPGQC-VRDSRILPDKNVNFIKTHSLMEDVPALFGKPVL 389 

QY 4 09 VKSGVEY — TRLAVETAQGLDGHSHL-VMYLGTTTGSLHKAV — VSGDSSAHLVEEIQ — 461 

I : : : I I : h : : | | : I : I I I : I I I : | | : : 

Db 390 VRVSLQYRFTAI TVDPQVKTINNQYLDVLYIGTDDGKVLKAVNIPKRHAKALLYRKYRTS 449 

QY 4 62 LFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESR 521 

: I II: |::M I I I : :|: I I I I : I I I I I I I : 

Db 450 VHPHGAPVKQLKIAPGYGKWWGKDEIRLANLNHCASKTRCKDCVELQDPHCAWDAKQN 509 

Qy 522 TCCLLSAPNLNSWK QDMERGNPEWACAS GPMS RS LRPQS RPQI I KEVLAVPNS I LE 577 

I : I : I = : I I : I I : II : : : : I : : : I I I I 

Db 510 LC VSIDTVTSYRFLIQDWRGDDN-KCWSPQTDKKTVI KNKPSEVEN — EITNSIDE 563 



Search completed: May 5, 2004, 17:29:44 
Job time : 23 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: May 5, 2004, 17:28:47 ; Search time 48 Seconds 

(without alignments) 
4394.610 Million cell updates/sec 

Title: US-10-015-391A-277 
Perfect score: 4031 

Sequence: 1 MALPALGLDPWSLLGLFLFQ CRTSAS DVDADNNCLGTEVA 761 

Scoring table: 

Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 
1138120 seqs, 277189581 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1138120 



Post-processing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Published_Applications AA: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



/cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 
/cgn2__6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/US06__PUBCOMB.pep:* 
/ cgn2_6/p t oda t a/ 2 /pubpaa /US 0 7_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep:* 
/cgn2__6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 
/cgh2_6/ptodata/2/pubpaa/US09B__PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep : 
/cgn2__6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: 
/cgn2__6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: 
/ cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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100. 


0 


761 


14 


us- 


-10- 


-015- 


391A-277 


Sequence 


277 , 


Ann 
s^pp 


31 


4031 


100. 


0 


761 


14 


us- 


10- 


017- 


407A-277 


Sequence 


277 , 


Ann 


32 


4031 


100. 


0 


761 


14 


us- 


10- 


011- 


833A-277 


Sequence 


211 , 


Ann 
j^PP 


33 


4031 


100. 


0 


761 


14 


us- 


10- 


006- 


041A-277 


Sequence 


211 r 


App 


34 


4031 


100. 


0 


761 


14 


us- 


10- 


015- 


822A-277 


Sequence 


211 r 


App 
■^PP 


35 


4031 


100. 


0 


761 


14 


us- 


10- 


015- 


387A-277 


Sequence 


211 , 


Ann 


36 


4031 


100. 


0 


761 


14 


us- 


10- 


006- 


130A-277 


Sequence 


277, 


App 


37 


4031 


100. 


0 


761 


14 


us- 


10- 


006- 


172A-277 


Sequence 


277, 


App 


38 


4031 


100. 


0 


761 


14 


us- 


10- 


017- 


253A-277 


Sequence 


277, 


App 


39 


4031 


100. 


0 


761 


14 


us- 


10- 


015- 


392A-277 


Sequence 


277, 


App 


40 


4031 


100. 


0 


761 


14 


us- 


10- 


017- 


306A-277 


Sequence 


277, 


App 


41 


4031 


100. 


0 


761 


14 


us- 


10- 


017- 


867A-277 


Sequence 


277, 


App 


42 


4031 


100. 


0 


761 


14 


us- 


10- 


012- 


064A-277 


Sequence 


277, 


App 


43 


4031 


100. 


0 


761 


14 


us- 


10- 


013- 


909A-277 


Sequence 


277, 


App 


44 


4031 


100.0 


761 


14 


us- 


10- 


015- 


671A-277 


Sequence 


277, 


App 


45 


4031 


100. 


0 


761 


14 


us- 


10- 


015- 


610A-277 


Sequence 


277, 


App 



ALIGNMENTS 



RESULT 1 

US-09-759-130B-389 

; Sequence 389, Application US/09759130B 
; Publication No. US20030022279A1 
; GENERAL INFORMATION: 

; APPLICANT: Millennium Pharmaceuticals, Inc. 



APPLICANT: McCarthy, Sean A 
APPLICANT: Fraser, Christopher C 
APPLICANT: Sharp, John D 
APPLICANT: Barnes, Thomas S 
APPLICANT: Kirst, Susan J 
APPLICANT: Mackay, Charles R 
APPLICANT: Myers, Paul S 
APPLICANT: Leiby, Kevin R 
APPLICANT: Wrighton, Nicolas 
APPLICANT: Goodearl, Andrew 
APPLICANT: Holtzman, Douglas A 

TITLE OF INVENTION: NOVEL GENES ENCODING PROTEINS HAVING 

TITLE OF INVENTION: PROGNOSTIC, DIAGNOSTIC, PREVENTIVE, THERAPEUTIC, AND 
OTHER 

TITLE OF INVENTION: USES . 
FILE REFERENCE: MPI00-535OMNIM 
CURRENT APPLICATION NUMBER: US/ 09/759, 130B 
CURRENT FILING DATE: 2002-09-16 
PRIOR APPLICATION NUMBER: US 09/479,249 
PRIOR FILING DATE: 2000-01-07 
PRIOR APPLICATION NUMBER: US 09/559,497 
PRIOR FILING DATE: 2000-04-27 
PRIOR APPLICATION NUMBER: US 09/578,063 
PRIOR FILING DATE: 2000-05-24 
PRIOR APPLICATION NUMBER: US 09/333,159 
PRIOR FILING DATE: 1999-06-14 
PRIOR APPLICATION NUMBER: US 09/596,194 
PRIOR FILING DATE: 2000-07-14 
PRIOR APPLICATION NUMBER: US 09/342,364 
PRIOR FILING DATE: 1999-06-29 
PRIOR APPLICATION NUMBER: US 09/608,452 
PRIOR FILING DATE: 2000-06-30 
PRIOR APPLICATION NUMBER: US 09/393,996 
PRIOR FILING DATE: 1999-09-10 
PRIOR APPLICATION NUMBER: US 09/602,871 
PRIOR FILING DATE: 2000-06-23 
PRIOR APPLICATION NUMBER: US 09/420,707 
PRIOR FILING DATE: 1999-10-19 
NUMBER OF SEQ ID NOS : 460 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 389 
LENGTH: 7 61 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-759-130B-389 

Query Match 100.0%; Score 4031; DB 10; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps C 
Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I i I I I i I I I I I I I I I I I I i I I I i I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 



Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 

1 1 I I l 1 1 1 1 i i i i i i 1 I 1 1 1 1 t 1 t 1 I 1 l 1 i i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 II 1 1 1 II 1 II 1 II 1 M 1 1 II 1 1 1 1 1 II 1 1 II 1! 1 1 1 1 1 1 1 1 1 1 1 | 1 i 1 1 1 1 i 1 1 1 

ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 


Db 


121 


180 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 

1 ] 1 i 1 i i i i i i i i i i i i i i i i I I I I I i i I i i i i i i i i i i i i i i i i i i i t t i i i i i i i i i i 

M 1 1 If 1 II 1 1 II II II II 1 II II 1 II 1 1 1 II II II II If M 11 1 1 1 II 1 1 1 1 1 1 I 1 I II 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 


Db 


181 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 

i i i i i i i i i i i i i i i t i i i i i i i i t i i i i i i i i t i i t i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II II 1 1 1 1 I I II 1 1 I 1 | | I | | I I I | I | | 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Db 


241 


300 


Qy 


301 


FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 

1 1 1 1 1 1 1 t 1 1 1 1 1 I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l i i i l l l i i i i i r i i i i i i i i 

1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 1 M 1 1 1 1 1 1 | | M I I I | | | | | 
FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 


360 


Db 


301 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 

I I I i i i i t i i i i i i i i i i t i t i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 II II M II 1 II 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 I 1 1 I 1 I I I I | | | | | | | | | 1 | | | | | | | | | | | 

TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Db 


361 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 

1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 l l l l l I l l i i i i i i i i i i i i i i i i i i i i i i i i i i i i i \ i i i i i i 
1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M I | I | | 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Db 


421 


480 


Qy 

Db 


481 
481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 

i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 11 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I | I M 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 
540 


Qy 

Db 


541 
541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I 1 I 1 1 1 t 1 t 1 1 1 t 1 1 1 1 I I I I I I i i i i i i i i i i i i 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 
600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 

1 1 1 t 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 t 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l i t i i i i i i i 
1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I I I || I | | | 

ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Db 


601 


660 


Qy 

Db 


661 
661 


VKVP LT RVS GGAALAAQ Q S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARG KVQGC E 

1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 
VKVP L T RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RALRARGKVQ G C E 


720 
720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I! 1 ! 1 ! 1 1 1 1 1 i 1 1 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 




Db 


721 




RESULT 2 
US-09-946- 


-374- 


-277 





Sequence 277, Application US/09946374 
Publication No. US20030073129A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Botstein, David 

APPLICANT: Desnoyers, Luc 

APPLICANT: Eaton, Dan L . 

APPLICANT: Ferrara, Napoleone 

APPLICANT: Fong, Sherman 



APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 
APPLICANT: Roy, Margaret Ann 
APPLICANT: Smith, Victoria 
APPLICANT: Stewart, Timothy A. 
APPLICANT: Tumas, Daniel 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Williams, P. Mickey 
APPLICANT: Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C1 

CURRENT APPLICATION NUMBER: US/09/946,374 
CURRENT FILING DATE: 2001-09-04 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099602 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099642 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099741 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099754 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099763 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099792 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099808 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099812 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099815 



PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099816 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/100385 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100388 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100390 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100584 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101279 
PRIOR FILING DATE: 1998-09-22 
PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 



PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER : 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102484 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102487 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103401 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 



; PRIOR FILING DATE: 1998-10-08 

; PRIOR APPLICATION NUMBER: 60/104257 

; PRIOR FILING DATE: 1998-10-14 

; PRIOR APPLICATION NUMBER: 60/104987 

; PRIOR FILING DATE: 1998-10-20 

; PRIOR APPLICATION NUMBER: 60/105000 

; PRIOR FILING DATE: 1998-10-20 

; PRIOR APPLICATION NUMBER: 60/105002 

; PRIOR FILING DATE: 1998-10-20 

; PRIOR APPLICATION NUMBER: 60/105104 

; PRIOR FILING DATE: 1998-10-21 

; PRIOR APPLICATION NUMBER: 60/105169 

; PRIOR FILING DATE: 1998-10-22 

; PRIOR APPLICATION NUMBER: 60/105266 

; PRIOR FILING DATE: 1998-10-22 

; PRIOR APPLICATION NUMBER: 60/105693 

; PRIOR FILING DATE: 1998-10-26 

; PRIOR APPLICATION NUMBER: 60/105694 

; PRIOR FILING DATE: 1998-10-26 

; PRIOR APPLICATION NUMBER: 60/105807 

Query Match 100.0%; Score 4031; DB 10; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I ! I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I II I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QVVYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I II I I I I I I I II I I I I II I I I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 



Qy 481 VFVGFSGGWRVPRANCSWESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I I I M I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I | | | I 
Db 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

Qy 541 GN P EWACAS GPMS RSLRPQSRPQII KEVLAVPN S I LE L P C PHL S ALAS YYWS H G P AAVP E 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 AS STWNGSLLLI VQDGVGGLYQCWATENGFS YPVI S YWVDSQDQTLALDPELAGI PREH 660 

M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I 

Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FAL VL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | 
Db 661 VKVP LT RVS G GAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 



RESULT 3 

US-10-401-053A-2 

; Sequence 2, Application US/10401053A 

; Publication No. US20040052782A1 

; GENERAL INFORMATION: 

; APPLICANT: Kikutani, H. et al 

; TITLE OF INVENTION: Costimulatory Molecules and Uses Thereof 
; FILE REFERENCE: 1/1311 

; CURRENT APPLICATION NUMBER: US/10/401 , 053A 

; CURRENT FILING DATE: 2003-03-27 

; PRIOR APPLICATION NUMBER: DE 102 13 762.5 

; PRIOR FILING DATE: 2002-03-26 

; PRIOR APPLICATION NUMBER: 60/371,050 

; PRIOR FILING DATE: 2002-04-09 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 761 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-10-401-053A-2 

Query Match 100.0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I II I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | || | | | 

Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I II I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 



Db 



121 ETQCFNFI RVLVS YNVTHLYTCGTFAFS PACT FI ELQDS YLLP I S EDKVMEGKGQS P FDP 180 



Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I II I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I 1 I I I I I I I I I I I I I I ! I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 3 00 

I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADS PTAPHI YAVFT S QWQVGGTRS S AVCAFS LLDI ERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I I I I I I i I I I I I I I 1 I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 FNVI RHAVLLPADS PTAPHIYAVFTSQWQVGGTRSSAVCAFS LLDI ERVFKGKYKELNKE 360 

Qy 361 T S RWTT YRGPETN PRPGS C SVGP S S DKALT FMKDHFLMDEQWGT PLLVKS GVE YTRLAV 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSS DKALT FMKDHFLMDEQWGT PLLVKS GVE YTRLAV 42 0 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

I ! I I I I I ! I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

Qy 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I 
Db 4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

Qy 541 GNPEWACAS GPMS RS LRPQ S RPQ 1 1 KEVLAVPNS I LELPC PHLSALAS YYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I i I I I I I I I I 
Db 541 GN P EWACAS G PMS RS L RPQ S RP Q 1 1 KEVLAVPN S I LEL P C PHL S ALAS Y YWS H GPAAVP E 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YWP H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 VKVP LT RVS GGAALAAQQ S YWPH FVTVTVL FALVL S GAL 1 1 LVAS P LRALRARGKVQGCE 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



RESULT 4 

US-10-006-485A-277 

Sequence 277, Application US/10006485A 
Publication No. US20030064062A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 



APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C9 

CURRENT APPLICATION NUMBER: US/10/006, 4 85A 

CURRENT FILING DATE: 2001-12-06 

PRIOR APPLICATION NUMBER: 60/098716 

PRIOR FILING DATE: 1998-09-01 

PRIOR APPLICATION NUMBER: 60/098723 

PRIOR FILING DATE: 1998-09-01 

PRIOR APPLICATION NUMBER: 60/098749 

PRIOR FILING DATE: 1998-09-01 

PRIOR APPLICATION NUMBER: 60/098750 

PRIOR FILING DATE: 1998-09-01 

PRIOR APPLICATION NUMBER: 60/098803 

PRIOR FILING DATE: 1998-09-02 

PRIOR APPLICATION NUMBER: 60/098821 

PRIOR FILING DATE: 1998-09-02 

PRIOR APPLICATION NUMBER: 60/098843 

PRIOR FILING DATE: 1998-09-02 

PRIOR APPLICATION NUMBER: 60/099536 

PRIOR FILING DATE: 1998-09-09 

PRIOR APPLICATION NUMBER: 60/099596 

PRIOR FILING DATE: 1998-09-09 

PRIOR APPLICATION NUMBER: 60/099598 

PRIOR FILING DATE: 1998-09-09 

PRIOR APPLICATION NUMBER: 60/099602 

PRIOR FILING DATE: 1998-09-09 

PRIOR APPLICATION NUMBER: 60/099642 

PRIOR FILING DATE: 1998-09-09 

PRIOR APPLICATION NUMBER: 60/099741 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099754 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099763 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099792 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099808 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099812 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099815 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/099816 

PRIOR FILING DATE: 1998-09-10 

PRIOR APPLICATION NUMBER: 60/100385 

PRIOR FILING DATE: 1998-09-15 

PRIOR APPLICATION NUMBER: 60/100388 

PRIOR FILING DATE: 1998-09-15 

PRIOR APPLICATION NUMBER: 60/100390 

PRIOR FILING DATE: 1998-09-15 

PRIOR APPLICATION NUMBER: 60/100584 



PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/10127 9 
PRIOR FILING DATE: 1998-09-22 
PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 



PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102484 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102487 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103401 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/104257 
PRIOR FILING DATE: 1998-10-14 
PRIOR APPLICATION NUMBER: 60/104987 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105000 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105002 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105104 



PRIOR FILING DATE : 1998-10-21 
PRIOR APPLICATION NUMBER: 60/105169 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105266 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105693 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105694 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105807 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105881 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105882 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/106023 
PRIOR FILING DATE: 1998-10-28 

Query Match 100.0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I I 1 1 I I I I I I I I I I I I i 1 1 I I 1 1 I I I I I II 1 1 I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 T S RWTT YRGPETN PRPGS CS VGP S S DKALT FMKDH FLMDEQWGT PLLVKS GVEYTRLAV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVG PS SDKALTFMKDHFLMDEQWGT PLLVKS GVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I II I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

Qy 481 VFVGFSGGWRVPR7\NCSVTESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I 

Db 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 



Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I II I I I I I I 

Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQ Q S YW PH FVT VT VL FALVL S GAL 1 1 L VAS P L RAL RARGKVQ GC E 720 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I II I 
Db 661 VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 



RESULT 5 

US-10-013-907A-277 

Sequence 277, Application US/10013907A 
Publication No. US20030064925A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C34 

CURRENT APPLICATION NUMBER: US/10/ 013, 907A 
CURRENT FILING DATE: 2001-12-10 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-013-907A-277 



Query Match 100.0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I i I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I II I | | | | | 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I II I I | | M | I 

Db 121 ETQC FN FI RVLVS YNVTHL YT CGT FAFS PACT FI ELQDS YLLP I S EDKVMEGKGQS P FD P 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I II I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | II I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | 
Db 301 FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I II I I I I I I I I I I I I II 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

Qy 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 481 VFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTWNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAAL AAQ Q S YW P H FVT VT VL FAL VL S GAL 1 1 L VAS P L RAL RARG KVQ G C E 72 0 

I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I || I I I I I | | | | | | | | | | | | | | | | | | | | 
Db 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 

I I II I I I I I I II I I I I I I I I I I II I I II I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSAS DVDADNNCLGTEVA 761 

RESULT 6 

US-10-015-499A-277 

; Sequence 277, Application US/10015499A 

; Publication No. US20030065142A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 

; APPLICANT: Botstein, David 



APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni f Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C42 

CURRENT APPLICATION NUMBER: US/ 10/ 015 , 4 99A 
CURRENT FILING DATE: 2001-12-11 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 7 61 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-499A-277 

Query Match 100*0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7 61; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I II I I I I I I I I I I I I II II I I I II I II I I i I I I I II I I I I I I I I I ! I II I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I II I I I II I I I II I II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I 

Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGML YS GTMNNFLGS EP I LMRTLGSQ PVLKTDNFLRWLHHDAS FVAAI P ST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I i I 1 1 I I I 1 1 1 1 I I I I 1 1 I I I I I 1 1 I 1 1 I I I ! 1 1 I I 1 1 I I I 1 1 I I 1 1 I I I I I I I I 1 1 I I I 

Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLL P AD S PT APH I YAVFT S QWQVGGTRS S AVCAFS LLD I E RVFKGKYKELNKE 360 

I I I I I I I I II I I I I I I I I I I I I II I I II I I I I I I I I I I! I I I I I I I I I II I I I I I I I I I I 

Db 301 FNVI RHAVLL PAD S PTAPH I YAVFT S QWQVGGTRS S AVCAFS LLD I E RVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 

I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 



Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 








Hi 1 /\^>*CiXlJ wliOIiXj V l v l I JjOl X X VjO XliiXVO. V V O UUJ Q.r\_L -IJ-J V l_i J_j X ^Xj X X 1J X -Cj X V IAIN ±j\/±-U^i.i- X V^vj-T^. 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 






rl O X 


V r Vbr pJUuVI/VaVi r\r\±v v_* O V iJjOU VL"^ V ±Jf\r\LJ XT nOA«V UL. XjO r\X ^ Xl Xl iJt\S- IN J-ilN fJ VV L\\sUrLLj±\ 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVIAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 






sj TX X 


nNPFWATA c !rcPM c ;'R c JT,RPO c ;RPnT T KFVT.AVPN^TT.FT.PPPKT.S AT.A^ YYW^HGPAAVPF 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UiJ 


DUX 


A^^TVYNTf^T T T T VnnrTV/f^T.YnrTA/ATFNnF^ YPVT ^ YWVD^nnOTT, AT.HPFT.A^T PRFH 

f\D OX V IlNoOXlXlXlX VyjJO V O^JXl X VV-ttX XjIN ur O XxTVXOXvWV U tD^xJ^Z J- Xl-ttXjJ-/ X XLiXJ-rVO X X JAXjXL 




Qy 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARG KVQ GC E 


720 






1 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 








1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





RESULT 7 

US-10-013-910A-277 

Sequence 277, Application US/10013910A 
Publication No. US2 0030187192A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C33 

CURRENT APPLICATION NUMBER: US/ 10/ 013 , 91 OA 
CURRENT FILING DATE: 2001-12-10 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 



; PRIOR APPLICATION NUMBER: 60/098803 

; PRIOR FILING DATE: 1998-09-02 

; PRIOR APPLICATION NUMBER: 60/098821 

; PRIOR FILING DATE: 1998-09-02 

; PRIOR APPLICATION NUMBER: 60/098843 

; PRIOR FILING DATE: 1998-09-02 

; PRIOR APPLICATION NUMBER: 60/099536 

; PRIOR FILING DATE: 1998-09-09 

; PRIOR APPLICATION NUMBER: 60/099596 

; PRIOR FILING DATE: 1998-09-09 

; PRIOR APPLICATION NUMBER: 60/099598 

; PRIOR FILING DATE: 1998-09-09 

; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 477 
; SEQ ID NO 277 
LENGTH: 761 
; TYPE : PRT 

; ORGANISM: Homo sapiens 
US-10-013-910A-277 

Query Match 100.0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I M I I I I I I I I I i I II I I I I I I I I I I I I I I I I I I I I ! I I I I | I I I 

Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQD FDTLLLS GDGNTL YVGAREAI LALDI QDP GVPRLKNMI PWPAS DRKKS ECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I II I I I I I I 1 I I I I I I I I I M I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTL YVGAREAI L AL D I Q D P GVP RLKNM I P W PAS D RKK S E CAFKKK S N 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I 1 I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I I I I I I I I i I I I I I I I I I I i I I I I I I I I I I II I I I I I I I i I I I I I I I I I I I I I I I 

Db 301 FN VI RHAVLLPADS PTAPHI YAVFT S QWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 3 60 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I M I I I I I I I I II II I I I I I II I I I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 



QY 



4 81 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 



Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I 
ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P LRALRARGKVQGCE 720 
I I I M I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
VKVP LT RVS GGAALAAQQ S YW PH FVT VTVL FALVL S GAL 1 1 LVAS P LRALRARGKVQGCE 72 0 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



RESULT 8 

US-10-226-254A-277 

Sequence 277, Application US/10226254A 
Publication No. US20030224478A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C68 

CURRENT APPLICATION NUMBER: US/10/22 6, 254A 
CURRENT FILING DATE: 2002-08-21 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 



; PRIOR FILING DATE : 1998-09-02 

; PRIOR APPLICATION NUMBER: 60/099536 

; PRIOR FILING DATE: 1998-09-09 

; PRIOR APPLICATION NUMBER: 60/099596 

; PRIOR FILING DATE: 1998-09-09 

; PRIOR APPLICATION NUMBER: 60/099598 

; PRIOR FILING DATE: 1998-09-09 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 477 
; SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-226-254A-277 



Query Match 100.0%; Score 4031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


l 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 


Qy 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 






1 1 M I II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 


Qy 


121 


ETQCFNFI RVLVS YNVTHLYTCGTFAFS PACTFI ELQDS YLLPI SEDKVMEGKGQS P FDP 


180 






1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II II 1 1 1 




Db 


121 


ETQCFNFI RVLVS YNVT'HLYTCGTFAFS PACTFI ELQDS YLLP I SEDKVMEGKGQSP FDP 


180 


Qy 


181 


AHKHTAVLVDGML YS GTMNN FLGS EP I LMRTLGS QPVLKTDNFLRWLHHDAS FVAAI P ST 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AHKHTAVLVDGMLYS GTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDAS FVAAI P ST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






I | | I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVI RHAVLLPADS PTAPHI YAVFT SQWQVGGTRS SAVCAFS LLDI ERVFKGKYKELNKE 


360 




I | | I I I I I I II 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FNVI RHAVLL PAD S PT APH I YAVFT S QWQVGGT RS SAVCAFS LLD I ERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 




1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 




Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 




1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I i I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAG1PREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FAL VL S GAL 1 1 LVAS P L RAL RARGKVQ G C E 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I i I I I | I I I 
Db 661 VKVP LT RVS GGAALAAQ Q S YW PH FVT VT VL FAL VL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 721 T L RP GE KAP LSREQHLQSPKECRT S AS D VD ADNN C L GT EVA 7 61 



RESULT 9 

US-10-015-395A-277 

Sequence 277, Application US/10015395A 
Publication No. US20040073015A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C57 

CURRENT APPLICATION NUMBER: US/10/015, 395A 
CURRENT FILING DATE: 2001-12-12 

Prior application removed - See file Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-395A-277 



Query Match 100.0%; Score 4 031; DB 12; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

M M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I M I I I I I I I I I I II I I I 



rv'U 
Db 




Qy 




Db 


1 O 1 
1Z 1 


Qy 


1 ft. 1 
101 


Db 


1 O 1 

lo 1 


Qy 


Z11 


Db 


z 4 1 


Qy 


^ n i 

jUI 


Db 




Qy 


O Dl 


Db 


361 


Qy 


/I O 1 
4Z 1 


Db 


A O 1 

4z 1 


Qy 


/t ft 1 
4 o 1 


Db 


481 


Qy 


R/l 1 
D4 1 


Db 


o4 1 


Qy 


OU 1 


Db 


bUl 


Qy 


DDI 


Db 


661 


Qy 


721 


Db 


721 



LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 18 0 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

I I II I M I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I 
AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I I I I I I I I I I I I II I I I I I I I I II 1 I I I 
QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

FNVI RHAVLLPAD S PTAPHI YAVFT SQWQVGGTRS SAVCAFS LLDI ERVFKGKYKELNKE 360 

M | | | || i I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
FNVI RHAVLLPADS PTAPHI YAVFT SQWQVGGTRS SAVCAFS LLDI ERVFKGKYKELNKE 360 

TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
T S RWTT YRGPETNP RPGS C S VGP S S DKALT FMKDHFLMDEQWGT PLLVKS GVE YTRLAV 420 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M II I I I I I 
VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

GN P EWACAS GPMS RS LRPQ S RP Q 1 1 KEVLAVPN S I LEL P C PHL S ALAS Y YW S H G P AAVP E 600 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I 
ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 
I | | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARG KVQ GC E 720 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



RESULT 10 
US-10-042-431-19 

; Sequence 19, Application US/10042431 

; Publication No. US20020182675A1 

; GENERAL INFORMATION: 

; APPLICANT: MCCARTHY, Sean A 

; APPLICANT: BARNES, Thomas M 

; APPLICANT: FRASER, Christopher C 

; APPLICANT: SHARP, John D 

; TITLE OF INVENTION: NOVEL GENES ENCODING PROTEINS HAVING DIAGNOSTIC, 



; TITLE OF INVENTION: PREVENTIVE, THERAPEUTIC, AND OTHER USES 
; FILE REFERENCE: 10147-6U2 

; CURRENT APPLICATION NUMBER: US/10/042,431 

; CURRENT FILING DATE: 2001-10-25 

; PRIOR APPLICATION NUMBER: US 09/333,159 

; PRIOR FILING DATE: 1999-06-14 

; PRIOR APPLICATION NUMBER: US 09/578,063 

; PRIOR FILING DATE: 2000-05-24 

; NUMBER OF SEQ ID NOS : 7 9 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 19 

LENGTH: 761 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-042-431-19 



Query Match 100.0%; Score 4031; DB 13; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7 61; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 


60 






1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i 


MALPALGLDPWSLLGLFLr QLLQLLLr 1 1 1 A(jO(jC^brMrKVKY YAbDJljKKAljb a t nyj\(j 


D U 


Qy 


61 


LQDFDTLLLSGDGNTLYVGARhAl LALUiyUFbvr KLrvJMiyil rWrAoDKJ\J\oiliUAJt! J\J\r\oJN 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LQDFDTLLLSGDGNTLYVGARhAlLALDiyDFCirVFRLKNJyil rWrAbDKJ\Ab.n J LA£ J\i\i\oIM 


ion 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 




Db 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAt SPAC1 1 ±b*LQDb xLLrl bhDKVMhjbJ\bybF£ 


loU 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 






1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 M 1 1 ! i 1 I 1 1 1 1 1 1 1 I 1 1 1 i ! 1 1 II 1! 1 1 1 1 1 




Db 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRS SAVCAFSLLDIERVFKGKYKELNKE 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FNVI RHAVLL PAD SPTAPH I YAVFTSQWQVGGTRS SAVCAFSLLDIERVFKGKYKELNKE 


360 


Qy 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 




II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 




Db 


361 


TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


480 


Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






I 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


481 


VFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 



I I I I I I I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FAL VL S GAL 1 1 LVAS P L RALRARGKVQ GC E 720 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I II I 

Db 661 VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FAL VL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 72 0 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I 1 I I I 1 I I I i I I I I I I I I I 1 I I I I I I 
Db 721 T LRP GEKAPLS REQHLQ S PKECRT S AS DVDADNNCLGT EVA 761 



RESULT 11 

US-10-006-856A-277 

Sequence 277, Application US/10006856A 
Publication No. US20030044 841A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C14 

CURRENT APPLICATION NUMBER: US/10/006, 856A 
CURRENT FILING DATE: 2002-05-10 
NUMBER OF SEQ ID NOS : 477 

Prior Application removed - See File Wrapper or Palm 
SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-006-856A-277 



Query Match 100.0%; Score 4031; DB 14; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 7 61; Conservative 0; Mismatches 0; 



Length 761; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I I II I I I I I I I I I I I I II I I II I 
1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 



Qy 



61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 



Db 


61 


LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 


120 


Qy 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 






t 1 I 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 ! 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 I 1 1 1 

M 1 1 1 ! 1 1 li M 1 1 1 1 II 1 1 I 1 1 II 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 1 ! 1 II II 1 I 1 1 1 1 1 




Db 


121 


ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 


180 


Qy 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 






i i i i i i i i i i i i i i i i i i I i i I i I i I ] I I i i i I i i I I i I I I I I I I I i I i i I I I I i I i I I I 
1 1 1 1 II 1 1 1 II I II 1 1 1 1 1 1 i 1 1 1 II II 1 1 II 1 I 1 II 1 1 1 1 1 1 M 1 II 1 II 1 II MINI 




Db 


181 


AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 


240 


Qy 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 






1 1 1 I 1 1 1 1 I t 1 ) 1 I 1 1 1 1 1 1 I 1 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 I I i 1 1 1 1 1 ! 1 1 1 1 1 1 t ! 1 1 1 1 ! 1 
1 1 1 II II 1 1 1 1 I 1 II 1 1 1 1 1 t 1 II II II 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 II 1 1 1 II 1 1 1 1 1 M 




Db 


241 


QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 


300 


Qy 


301 


FNVIRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 


360 






i i i i i t i i i i i ) i i i i i i i i l l i i i l i l l i i i i i i l t l l l l 1 l l 1 l l 1 l l 1 1 I I 1 1 l l l l 
1 I 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


FNVI RHAVL L PAD S P TAP H I YAVFT S QWQVG GT R S S AVC AFS L L D I E RVFK GK YK ELN K E 


360 


Qy 


361 


T S RWTT YRGPETNPRPGS C S VGP S S DKALT FMKDH FLMDEQWGT PLLVKS GVEYTRLAV 


420 






i i i i i i i i t i i i i j i i i i i i t i i i i i i i i i i i i i i i t i t \ i i i t i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 II 1 II II 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 I II 1 1 i 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


T S RWTT YRGPETNPRPGS CS VGP S S DKALT FMKDH FLMDEQWGT PLLVKS GVEYTRLAV 


420 


Qy 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 






> i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i t i t i i i i i i i i i i i i i i i i i i i i i i i 
II II I 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II II 1 1 II i 1 1 1 II 1 1 1 II 1 1 1 1 I 1 II 1 1 1 I 1 1 1 1 1 1 1 




Db 


421 


ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 


480 


Qy 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i t i i i i i i i i i i i i i i i i 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 II 1 11 1 1 1 1 1 1 1 1 1 1 




Db 


481 


VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 


540 


Qy 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ t ! 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 i 1 1 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 


600 


Qy 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 






i i i i i I i i i i i i i i i i i i i i i i i i i i i i i t i t i i i i i i i i i i i i i i i i i i i i I i i i i i i i 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


601 


ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 


660 


Qy 


661 


VKVPLT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RALRARGKVQ GC E 


720 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 




Db 


661 


VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQGC E 


720 


Qy 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 








1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


721 


TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 





RESULT 12 

US-10-006-818A-277 

; Sequence 277, Application US/10006818A 

; Publication No. US20030054406A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 

; APPLICANT: Botstein, David 

; APPLICANT: Desnoyers, Luc 

; APPLICANT: Eaton, Dan 1. 



APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 1 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C4 

CURRENT APPLICATION NUMBER: US/10/006, 818A 
CURRENT FILING DATE: 2001-12-06 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-006-818A-277 

Query Match 100.0%; Score 4031; DB 14; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 1 I I 1 I I I I I I I I I I I I I I 
MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I M I I I I ! I I I I I I I I I I I I I I 11 I I 

LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 
ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I M I I I I I I I I I I I I I 

ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I II I I I I II I I I I I I I I I I 1 I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AUKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 
TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 

I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I M I I II I I i I I I I I I I I I I 

T S RWTT YRGPETN PRPGS C S VGP S S DKALT FMKDHFLMDEQWGT PLLVKS GVE YTRLAV 420 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



Qy 



421 ETAQGLDGHSHLYMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 


A O 1 

4Zi 


Qy 


/pi 
4 o 1 


Db 


481 


Qy 


1 

j41 


Db 


541 


Qy 


bul 


Db 


601 


Qy 


bol 


Db 


661 


Qy 


721 


Db 


721 



ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I | | | | | I I I I I I I I I I I I I I I I i I I i I II i I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

GNPEWACASGPMSRSLRPQSRPQI IKEVLAVPNS I LELPCPHLSALAS YYWSHGPAAVPE 600 

I | | M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
GNPEWACASGPMSRSLRPQSRPQI IKEVLAVPNS I LELPCPHLSALAS YYWSHGPAAVPE 600 

ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I I I I I I I I I I I II I I I II II I I I I i I I I I I I I I I II t II I I I I I I I I I I II I I I I I 
ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VKVPLTRVSGGAALAAQQSYWPHFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCE 720 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 ii i 

TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



RESULT 13 

US-10-015-393A-277 

Sequence 277, Application US/10015393A 
Publication No. US20030069179A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C46 

CURRENT APPLICATION NUMBER: US/ 10/ 015 , 3 93A 
CURRENT FILING DATE: 2002-06-10 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-393A-277 



Query Match 100.0%; Score 4031; DB 14; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 



Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MALPALGLDPWS LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALS FFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMI PWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 11 11 I II I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMI PWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 FNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGKYKELNKE 360 

Qy 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 361 TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

Qy 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 481 VFVGFSGGVWRVPR7^NCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 

Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

Qy 661 VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GCE 720 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ G CE 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I 
Db 721 TLRPGEKAPLS REQHLQS PKECRT SAS DVDADNNCLGTEVA 761 



RESULT 14 

US-10-015-869A-277 



Sequence 277, Application US/10015869A 
Publication No. US20030073130A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C45 

CURRENT APPLICATION NUMBER: US/10/015, 869A 
CURRENT FILING DATE: 2002-06-25 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 277 
LENGTH: 7 61 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-869A-277 

Query Match 100.0%; Score 4031; DB 14; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7 61; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I ! I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I I I I I I I I I I I I II I I I I I I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 



QY 
Db 



301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRS SAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRS SAVCAFSLLDIERVFKGKYKELNKE 360 



Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 



TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 42 0 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

I I I I 1 I I I I I II I I I I I I I II I II I I I I I I I I I I I I I I I I I I I 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 
I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I | I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I II I I I I I I 
GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

AS STVYNGSLLLIVQDGVGGLYQCWATENGFS YPVI S YWVDSQDQTLALDPELAGI PREH 660 

M I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AS STVYNGSLLLIVQDGVGGLYQCWATENGFSYPVIS YWVDSQDQTLALDPELAGI PREH 660 

VKVP LT RVS G GAALAAQ Q S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

I | | | I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC E 720 

T LRP GEKAP L S REQHLQ S P KECRT S AS DVDADNN CLGT EVA 761 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
TLRP GEKAPLS REQHLQ SPKECRTSASDVDADNNCLGTEVA 7 61 



RESULT 15 

US-10-012-121A-277 

Sequence 277, Application US/10012121A 
Publication No. US2 0030073810A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C20 

CURRENT APPLICATION NUMBER: US/10/012, 121A 
CURRENT FILING DATE: 2001-12-07 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 477 
SEQ ID NO 277 



LENGTH: 761 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-012-121A-277 

Query Match 100.0%; Score 4031; DB 14; Length 761; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7 61; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I 1 I I I I I I I I I I I I I ! I I I I I I I I 1 I I M I I I II I I I I I I I I I I I I I 1 I I I I I I I I I I 1 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I 
ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 18 0 

AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 24 0 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I II I I I M I I I I I I M I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I 

QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

FNVI RHAVLLPADS PTAPHI YAVFT SQWQVGGTRS SAVCAFS LLDI ERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
FNVI RHAVLLPADS PTAPHI YAVFT SQWQVGGTRS SAVCAFS LLDI ERVFKGKYKELNKE 360 

TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

I I | | I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I 
TSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAV 420 

ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I | | | | | I | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 4 80 

VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 54 0 

GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I II 
GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

AS STVYNGSLLLIVQDGVGGLYQCWATENGFS YPVI S YWVDSQDQTLALDPELAGI PREH 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M II I I I I I I I 

AS STVYNGSLLLIVQDGVGGLYQCWATENGFSYPVIS YWVDSQDQTLALDPELAGI PREH 660 

VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ GC E 720 
I | | | | || | | || | || I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ G C E 720 



QY 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 



Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t 1 I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



Search completed: May 5, 2004, 17:32:25 
Job time : 51 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: May 5, 2004, 17:26:32 



Search time 52 Seconds 
(without alignments) 
4617.488 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-015-391A-277 
4031 

1 MALPALGLDPWSLLGLFLFQ . 



. CRTSASDVDADNNCLGTEVA 761 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL_25:* 

1: sp__archea:* 

2: sp_bacteria: * 

3 : sp__f ungi : * 

4 : sp_human : * 

5 : sp_invertebrate : * 

6 : sp_mammal : * 

7: sp_mhc:* 

8: sp_organelle : * 

9: sp__phage:* 

10: sp_plant:* 

11: sp_rodent:* 

12: sp_virus:* 

13 : sp_vertebrate : * 

14 : sp__unclassified: * 

15 : sp_rvirus : * 

16: sp_bacteriap: * 

17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


1743 


43.2 


328 


4 


Q9HA40 


Q9ha4 0 homo sapien 


2 


1239.5 


30.7 


823 


11 


Q8BIR6 


Q8bir6 mus musculu 


3 


1097 


27.2 


963 


4 


Q9C0C4 


Q9c0c4 homo sapien 


4 


1067.5 


26.5 


550 


11 


Q8BIC3 


Q8bic3 mus musculu 


5 


1041 


25.8 


862 


4 


Q7Z5S4 


Q7z5s4 homo sapien 


6 


1034.5 


25.7 


681 


4 


Q7Z3N1 


Q7z3nl homo sapien 


7 


995 


24.7 


799 


11 


Q8BJC1 


Q8bjcl mus musculu 


8 


882 


21.9 


722 


11 


Q810B2 


Q810b2 mus musculu 


9 


854.5 


21.2 


756 


13 


Q8QGU9 


Q8qgu9 gallus gall 


10 


847 


21.0 


774 


13 


Q8JIW9 


Q8jiw9 xenopus lae 


11 


817.5 


20.3 


748 


4 


Q8TB71 


Q8tb71 homo sapien 


12 


788 


19.5 


111 


11 


Q8BMF6 


Q8bmf6 mus musculu 


13 


787 


19.5 


111 


11 


Q8BH34 


Q8bh34 mus musculu 


14 


737 


18.3 


782 


4 


Q9NS98 


Q9ns98 homo sapien 


15 


736 


18.3 


635 


4 


Q96GX0 


Q96gx0 homo sapien 


16 


709.5 


17.6 


lib 


11 


Q9QX23 


Q9qx23 mus musculu 


17 


709 


17.6 


645 


4 


Q7Z5X0 


Q7z5x0 homo sapien 


18 


699 


17.3 


1122 


11 


Q7TT33 


Q7tt33 mus musculu 


19 


688.5 


17.1 


844 


11 


Q8BXU8 


Q8bxu8 mus musculu 


20 


678.5 


16.8 


587 


11 


Q8BUT0 


Q8but0 mus musculu 


21 


678.5 


16.8 


605 


11 


Q8BXZ7 


Q8bxz7 mus musculu 


22 


672.5 


16.7 


1073 


4 


Q8NFY4 


Q8nfy4 homo sapien 


23 


668 


16.6 


687 


4 


Q9BXR8 


Q9bxr8 homo sapien 


24 


652 


16.2 


1017 


4 


Q8NFY5 


Q8nfy5 homo sapien 


25 


650 


16.1 


1022 


4 


Q9P249 


Q9p24 9 homo sapien 


26 


649.5 


16.1 


998 


4 


Q8NFY6 


Q8nfy6 homo sapien 


27 


649.5 


16.1 


1009 


11 


Q80TD0 


Q80td0 mus musculu 


28 


643 


16.0 


1011 


4 


Q8NFY3 


Q8nfy3 homo sapien 


29 


642.5 


15.9 


1005 


11 


Q9EQ71 


Q9eq71 mus musculu 


30 


635 


15.8 


595 


11 


Q8BKG8 


Q8bkg8 mus musculu 


31 


623.5 


15.5 


452 


13 


Q7T165 


Q7tl65 brachydanio 


32 


615.5 


15.3 


963 


11 


Q91Y36 


Q91y36 mus musculu 


33 


602.5 


14.9 


697 


5 


Q8MLF1 


Q8mlfl drosophila 


34 


602.5 


14.9 


724 


5 


Q9V7Q7 


Q9v7q7 drosophila 


35 


583 


14.5 


770 


5 


044253 


044253 drosophila 


36 


581 


14.4 


770 


5 


Q9V3M4 


Q9v3m4 drosophila 


37 


556 


13.8 


418 


4 


Q96JU9 


Q96ju9 homo sapien 


38 


556 


13.8 


457 


4 


Q9HBR1 


Q9hbrl homo sapien 


39 


554 


13.7 


403 


11 


Q8BKQ6 


Q8bkq6 mus musculu 


40 


551 


13.7 


616 


5 


Q9V7P8 


Q9v7p8 drosophila 


41 


542 


13.4 


476 


4 


Q8NFY7 


Q8nfy7 homo sapien 


42 


523.5 


13.0 


923 


11 


Q8R4U3 


Q8r4u3 mus musculu 


43 


515.5 


12.8 


920 


11 


Q8R4U4 


Q8r4u4 rattus norv 


44 


496.5 


12.3 


367 


4 


Q9HAH9 


Q9hah9 homo sapien 


45 


494.5 


12.3 


1091 


5 


Q7YU67 


Q7yu67 drosophila 



ALIGNMENTS 



RESULT 1 
Q9HA40 

ID Q9HA40 PRELIMINARY; PRT; 328 AA. 

AC Q9HA4 0; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE 
OS 

oc 
oc 
ox 

RN 
RP 
RC 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Hypothetical protein FLJ12287. 
Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Mammary gland; 
Isogai T . , Ota T., Hayashi K., 
Nishikawa T., Nagai K. , Sugano 
Wagatsuma M. , Hosoiri T . , Kaku 
Takahashi M. , Chiba Y. , Ishida 
Watanabe S 
Yamamoto J 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Sugiyama T., Otsuki T., Suzuki Y. , 
S., Shiratori A. , Sudo H., 
Y., Kodaira H., Kondo H., Sugawara M., 
S., Murakawa K., Ono Y., Takiguchi S., 
Kimura K. , Murakami K., Ishii S., Kawai Y., Saito K., 
Wakamatsu A., Nakamura Y., Nagahari K., Masuho Y., 



GO:0005622; 
GO:0016020; 
GO: 0005524; 
GO: 0003684; 
GO: 0008094; 
GO:0004872; 
GO: 0003700; 



Ninomiya K. , Iwayanagi T.; 

"NEDO human cDNA sequencing project."; 

Submitted (AUG-2000) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AK022349; BAB14018.1; 

C: intracellular; IEA. 
C: membrane; IEA. 
F: ATP binding; IEA. 
F: damaged DNA binding; IEA. 
F: DNA dependent ATPase activity; IEA. 
F: receptor activity; IEA. 
F: transcription factor activity; IEA. 
GO: 0007275; P : development; IEA. 
GO: 0006310; P : DNA recombination; IEA. 
GO:0006281; P : DNA repair; IEA. 

GO: 0006355; P: regulation of transcription, DNA-dependent ; IEA. 
InterPro; IPR000005; HTHAraC . 
InterPro; IPR003659; Plexin-like. 

IPR002165; Plexin_repeat . 
IPR001553; RecA. 
IPR001627; Sema. 
Pfam; PF01437; PSI; 1. 
Pfam; PF01403; Sema; 1. 
PRINTS; PR00142; RECA. 
SMART; SM00423; PSI; 1. 

PROSITE; PS00041; HTH_ARAC_FAMILY_1 ; 1. 
Hypothetical protein. 

SEQUENCE 328 AA; 35403 MW; 6F455687BFCB2312 CRC64; 



GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 



InterPro; 
InterPro; 
InterPro; 



Query Match 43.2%; Score 1743; DB 4; Length 328; 

Best Local Similarity 100.0%; Pred. No. 1.8e-141; 

Matches 328; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 

Db 

Qy 

Db 



434 MYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVP 

I i I I I I i I I I I I I I I I I I ] I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
1 MYLGTTTGSLHKAWSGDSST^HLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVP 

494 RANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMS 

I I I I I I I I I I I I II I I I I I M I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 RANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMS 



0; 



493 



60 



553 



120 



Qy 

Db 



554 RSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLI 613 

I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I M I I I 

121 RSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLI 180 



Qy 614 VQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSGGAA 673 

I I I 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 I I 1 1 1 I I I I 1 1 1 1 I 1 1 1 II 1 1 I i I 1 1 I I I I I 1 1 I 1 1 1 1 I 

Db 181 VQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLiTRVSGGAA 24 0 



Qy 674 LAAQQSYWPHFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSRE 733 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 241 LAAQQSYWPHFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSRE 300 

Qy 734 QHLQSPKECRTSASDVDADNNCLGTEVA 761 

I I I I I I I I I II I II I I I I I I I I I I I I I I 
Db 301 QHLQSPKECRTSASDVDADNNCLGTEVA 328 



RESULT 2 
Q8BIR6 

ID Q8BIR6 PRELIMINARY; PRT; 823 AA. 

AC Q8BIR6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 4B. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Ovary, and Uterus; 

RX MEDLINE=22354683; PubMed=124 66851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK030331; BAC26906.1; 

DR GO; GO: 0005622; C : intracellular ; IEA. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0004872; F:receptor activity; IEA. 

DR GO; GO: 0003700; F: transcription factor activity; IEA. 

DR GO; GO: 0007275; P : development ; IEA. 

DR GO; GO: 0006355; P: regulation of transcription, DNA-dependent ; IEA. 

DR InterPro; IPR000005; HTHAraC. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS00041; HTH_ARAC_FAMILY_1 ; 1. 

SQ SEQUENCE 823 AA; 91485 MW; E3B8C5E5CFA1CD45 CRC64; 

Query Match 30.7%; Score 1239.5; DB 11; Length 823; 

Best Local Similarity 37.6%; Pred. No. 1.7e-97; 

Matches 295; Conservative 128; Mismatches 289; Indels 73; Gaps 24 



13 LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGD 72 

III: I I I I III I Ih : I : I : : :: lllll 

11 LLSLMVLLLLLLLWTTTTRALG PRINVPLCCQERLIRKFEAENISNYTALLLSQD 65 

73 GNTLYVGAREAILALDIQDPGVP—RLKNMI PWPASDRKKSECAFKKKSNETQCFNFIRV 130 

I I I 1 I I I I I I : II : : I : : : I I : I : I : I I I : I I : I : : 

66 GKTLYVGAREALFALNSNLS FLPGGEYQELL-WSADADRKQQCS FKGKDPKRDCQNYI KI 12 4 

131 LVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVM— EGKGQSPFDPAHKHTAVL 18 8 

I : I : I I I I I I I I I I I : I : I I : : I I I : I I I I III:: 

125 LLPLNSSHLLTCGTAAFSPLCAYIHIAS FTLAQDEAGNVILEDGKGRCPFDPNFKSTALV 184 

189 VDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAA IP 238 

I I I I I : I I : : : I I :: I : I : I : I I I : : I I I I : I I I : I 

185 VDGELYTGTVSSFQGNDPAISRSQSSRPT-KTESSLNWL-QDPAFVASAYVPESLGSPIG 242 

239 STQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ 298 

: I I I I II Ihlll I I I I I I I I I | | I : : | I : : I I : I I I I I I | I : : I 

243 DDDRIYFFFSETGQEFEFFENTIVSRVARVCKGDEGGERVLQQRWTSFLKAQLLCSRPDD 302 

2 99 -LP FN VI RH AVL L PADSPTAPHI YAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKY 354 

MM:: | M I I I I I I 1 II I I : I I : : I : : : I I I 

303 GFPFNVLQDVFTLNPNPQDW-RKTLFYGVFTSQWHRGTTEGSAICVFTMNDVQKAFDGLY 361 

355 KELNKETSRWTTYRGPETNPRPGSCSVGPS SDKALTFMKDHFLMDEQW 403 

I : : I : I I : I I I I I I : I : I : I I : I I I I I I I I I 

3 62 KKVNRETQQWYTETHQVPTPRPGACITNSARERKINSSLQLPDRVLNFLKDHFLMDGQVR 421 

404 GT PLLVKS GVE YTRLAVETAQGLDGH S -HLVMYLGTTTGS LHKAWS GD S S AHLVEEI QL 462 

II:: I I : I I II II : |::||| I Mill I |::||:|: 
422 SRLLLLQPRARYQRVAVHRVPGL — HSTYDVLFLGTGDGRLHKAVTL-S S RVHI I EELQI 478 

463 FP D P E P VRN LQLAPTQGAVFVGF S GGVWRVP RANC S VYE S CVDCVLARD PHCAWD P E S RT 522 

II : I I : II I I : : I I : I I I I I I : I : I I I : I M I I : I I I : 

479 FPQGQPVQNLLLDSHGGLLYASSHSGWQVPVANCSLYPTCGDCLLARDPYCAWTGSACR 538 

523 CCLLSAPNLNS — WKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPC 580 

I I : I I I I I : I : : I : : I I : I I I :: I I 

539 LAS LYKPDLASRPWTQDIEGASVKELCKNSSYKARFLVPGKP — CKQVQIQPNTVNTLAC 596 

581 P H L SAL AS Y YW S H G P AAVP E AS S — T VYN G S L L L I - VQ D G VG GL YQ C W AT ENGFSYPVIS 637 

Mill: II II : : I : MM: I I M :: I I I M I I II 
597 PLLSNLATRLWVHNGAPVNASASCRVLPTGDLLLVGSQQGL-GVFQCWSIEEGFQQLVAS 655 

638 YWVDSQDQ-TLALDPELAGI PREHVKVPLTRVS GGAALAAQQ S YW P H FVTVT VL FA 692 

I : :: : : II I : : I I I I I : Mill I : : II 
656 YCPEVMEEGVMDQKNQRDGTP VIINTSRVSAPAGGRASWGADKSYWNEFLVMCTLFV 712 

693 LVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECRTSASDVD7UD 752 

: : I : :: I MM : : : I I I 
713 FAMVLLFLFLLYRHRDGMKLFLKQGECASVHP KTRPIVLPPETRPL 758 

753 NNCLG 757 
I I : I 

759 -NCVG 762 



RESULT 3 
Q9C0C4 

ID Q9C0C4 PRELIMINARY; PRT; 963 AA. 

AC Q9C0C4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT Ol-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein KIAA1739 (Fragment) . 

GN KIAA1739. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID==9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21082932; PubMed=11214 970 ; 

RA Nagase T . , Kikuno R. , Hattori A. , Kondo Y. , Okumura K. , Ohara 0. ; 

RT "Prediction of the coding sequences of unidentified human genes. XIX. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro.' 1 ; 

RL DNA Res. 7:347-355(2000). 

DR EMBL; AB051526; RAB21830.1; -. 

DR Genew; HGNC: 10731; SEMA4C. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Hypothetical protein. 

FT NONJTER 1 1 

SQ SEQUENCE 963 AA; 106735 MW; 3 FA3 7 DBBA3 483 ECE CRC64; 



Query Match 27.2%; Score 1097; DB 4; 

Best Local Similarity 36.6%; Pred. No. 4e-85; 
Matches 291; Conservative 101; Mismatches 288; 



Length 963; 
Indels 116; 



Gaps 



28 



Qy 

Db 

Qy 

Db 

Qy 

Db 



6 LGLDPWSLLGLFLFQLLQL LLPTTTAGGGG QGPMPRVRYYAGD 48 

I I I I I II I M Ml Ml 

107 L GR P RW G CQ GQ RL FQ KC P L L P I RG FGWH L L VAWGAG S RG ARL RAVE P Q G S C P S AAMLT PA 166 

49 E RRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPW 103 

I II I I hill II I: 11111111:1 : : I : I I 

167 ELATWRR FSQTGIQDFLTLTLTEPTGLLYVGAREALFAFSME ALELQGAISW 219 

104 PASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLP 163 

I Ihll I I : I : I : I I I I I I I I I : I I I I I I : I I I I I : : : 
220 EAPVEKKTECIQKGKNNQTECFNFIRFLQPYNASHLYVCGTYAFQPKCTYVNM— -LTFT 276 



Qy 164 ISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF 223 

: : : | | | : | : I | | I : I I I I I I I I : I I I I I : II I : : I : I : I I : 
Db 277 LEHGEFEDGKGKCPYDPAKGHAGLLVDGELYSATLNNFLGTEPIILRNMGPHHSMKTEYL 336 

Qy 224 LRWLHHDASFV — AAIPST QWYFFFEETASEFDFFERLHTSRVARVCKNDV 273 

||:: II I : I : MINIMI: : I I I I I I I I : 

Db 337 AFW L - N E P H FVG S AYVP E SVG S FT GD D DKVY FF FRERAVE S DC YAEQ WARVARVC K GDM 395 

Qy 274 GGEKLLQKKWTTFLKAQLLCTQPG-QLPFNVIRHAVLLPADSPTAPHIYAVFTSQWQVGG 332 

I I : I I : I I I I I I I I : I I : I I I I I : : I I : I I : I I I 

Db 396 GGARTLQRKWTT FLKARLACS APNWQL YFNQLQAMHT LQDT SWHNTT FFGVFQAQW — GD 453 

Qy 333 TRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSC SV 381 

I I : I : I : I : I I I : I I I I : : I : I I I : I I I I I I I 
Db 454 MYLSAICEYQLEEIQRVFEGPYKEYHEEAQKWDRYTDPVPSPRPGSCINNWHRRHGYTSS 513 

Qy 382 G P S S D KALT FMKDH FLMD EQV VGT P L LVK S GVE YT RLAVET AQ G L D GH S H L VM Y L GT 438 

I 11:1111:111 I I I I I I : I I : I I I I : : I : : : I I 

Db 514 LELPDNILNFVKKHPLMEEQVGPRWSRPLLVKKGTNFTHLVADRVTGLDGATYTVLFIGT 573 

Qy 439 TTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCS 498 

I I III I I I : I I : I I I I I I : I : I I : : : : I I : : : I I : I 

Db 574 GDGWLLKAVSLG-PWVHLIEELQLF-DQEPMRSLVLSQSKKLLFAGSRSQLVQLPVADCM 631 

Qy 4 99 VYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWAC-ASGPMSRSLR 557 

I I I I I I I I I I I : I I I : I : : : I : : I I I : : I 

Db 632 KYRS CADCVLARDP YCAWS VNT S RCVAVGGHS GSLLI QHVMT S DT S GI CNLRG — SKKVR 689 

Qy 558 PQSRPQIIKEVLAVPNSILELPCPHLSA-LASYYWSHGPAAVP — EASSTVYNGSLLLIV 614 

I I : I : I I I I I I I : I I I : I : I : I : I : I : I 

Db 690 PTP KNITWAGTDLVLPC-HLSSNLAHARWTFGGRDLPAEQPGSFLYDARLQALV 743 

Qy 615 QDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSG 670 

I I I I : : I I II: I * I I : 
Db 744 VMAAQPRHAGAYHCFSEEQG ARLAAEGYLVAV VAGPSVTLEA 785 

Qy 671 GAALAAQQ S YWP H FVT VT VL FALVL S GAL 1 1 LVAS P L RALRARG KVQ GC ET LRP GEKA — 728 

II I : | | | : I I :: I I I I II Mill 

Db 786 RAPLENLGLVW LAWALGAVCL — VLLLLVLS LRRRLR EELEKGAKATE 832 

Qy 729 PLSREQHLQSP 739 

I I : I I 
Db 833 RTLVYPLELPKEPTSP 848 



RESULT 4 
Q8BIC3 

ID Q8BIC3 PRELIMINARY; PRT; 550 AA. 

AC Q8BIC3; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 4B. 

OS Mus inusculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
OX NCBI TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Skin; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs."; 

RL Nature 420:563-573(2002). 

DR EMBL; AK028896; BAC26181.1; -. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 550 AA; 61309 MW; 8C2 8E2 9DC4CAC958 CRC64; 

Query Match 26.5%; Score 1067.5; DB 11; Length 550; 

Best Local Similarity 42.7%; Pred. No. 6.1e-83; 

Matches 233; Conservative 87; Mismatches 185; Indels 41; Gaps 14; 

Qy 18 LFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLY 77 

I I I I I I I I I I II: III: I : : : : I I II I I I I I 

Db 15 LLLLLLLLLRTTTTRALG PRI SVPLGSEERLIRKFEAENI SNYTALLLSQDGKTLY 70 

Qy 78 VGARE AI LALD I Q D P GVP — RL KNMI P W PAS D RK K S E C AFKKK SNETQCFNFI RVLVS YN 135 

I || | | | : | I : : I : : : I I : I : I : I I I : I I : I : : I : I 

Db 71 VGAREALFALNSNLS FLPGGEYQELL-WSADADRKQQCS FKGKDPKRDCQNYI KI LLPLN 12 9 

Qy 136 VTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVM — EGKGQSPFDPAHKHTAVLVDGML 193 

: | | | | | | I I I I I : I : I I : : I I I : I I i I I I I :: I I I I 

Db 130 SSHLLTCGTAAFSPLCAYIHIAS FTLAQDEAGNVILEDGKGRCPFDPNFKSTALWDGEL 189 

Qy 194 YSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAA IPSTQW 243 

I : I I : : : I i : : I : I : I : I I I : : I I I I : I I I : I = 

D b 190 YTGTVSSFQGNDPAISRSQSSRPT-KTESSLNWL-QDPAFVASAYVPESLGSPIGDDDKI 247 

Qy 244 YFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFN 302 

M I I I I Ihlll I I I I I I I I I I I I :: I I I I : I I I I I I I I I IN 
Db 248 YFFFSETGQEFEFFENTIVSRVARVCKGDEGGERVLQQRWTSFLKAQLLCSRPDDGFPFN 307 

Qy 303 VIRHAVLL PAD S PTAPHI YAVFT SQWQVGGTRS S AVCAFS LLDI ERVFKGKYKELNK 359 

I : : I II I I I I I I I II I I : I I : : I : : : I I I I : : I : 

Db 308 VLQDVFTLNPNPQDW-RKTLFYGVFTSQWHRGTTEGSAI CVFTMNDVQKAFDGLYKKVNR 366 

Qy 360 ETSRWTTYRGPETNPRPGSCSVGPS S DKALTFMKDHFLMDEQWGT PLL 4 08 

| | : I I II II : I : I : I I : I I I I I I I I I I I 

Db 367 ETQQWYTETHQVPTPRPGACITNSARERKINSSLQLPDRVLNFLKDHFLMDGQVRSRLLL 426 

Qy 409 VKSGVEYTRLAVETAQGLDGHS-HLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPE 467 

: : Ihll II I I : I : : I I I I I I I I I I I : : I I : I : I I : 
Db 427 LQPRARYQRVAVHRVPGL — HSTYDVLFLGTGDGRLHKAVTL-SSRVHIIEELQIFPQGQ 4 83 



Qy 4 68 PVRNLQLAPTQGAVFVGFSGGVWRVPR7VNCSVYESCVDCVLARDPHCAWDPESRTCCLLS 527 

I I : I I I I : : I I : I I I II I : I : I I I : I I I I I : I I I : I 

Db 4 84 PVQNLLLDSHGGLLYASSHSGWQVPVANCSLYPTCGDCLLARDPYCAWTGSACRLASLY 543 

Qy 528 APNLNS 533 

I : I I 

Db 544 QPDLAS 549 



RESULT 5 
Q7Z5S4 

ID Q7Z5S4 PRELIMINARY; PRT; 8 62 AA. 

AC Q7Z5S4; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE SEMA4D protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Eye; 

RA Strausberg R. ; 

RL Submitted (JUL-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC054500; AAH54500.1; -. 

SQ SEQUENCE 862 AA; 96207 MW; 54C8EEB73B78938E CRC64; 

Query Match 25.8%; Score 1041; DB 4; Length 862; 

Best Local Similarity 35.2%; Pred. No. 2.3e-80; 

Matches 271; Conservative 115; Mismatches 263; Indels 120; Gaps 27 



Qy 38 PMPRVRYYAGDERRALSF — FHQKGLQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVP 95 

I : I I : : | | : I I : : : : I I I I I : I I I : I I I I I : I s : : = 
Db 26 PIPRITW EH REVH LVQ FH E P D I YN Y S AL LL S E D KDT L Y I GARE AVFAVNALN — IS 79 

Qy 96 RLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIE 155 

: : : I I : I I : : I I I I i = I : I I : I I I I : I I I I I I I I I I I : 

Db 80 EKQHEVYWKVS EDKKAKCAEKGKS KQTECLN YI RVLQP LS AT S L YVCGTNAFQ PACDHLN 139 

Qy 156 LQDSYLLPISEDKWEGKGQSPFDPAHKHTAVLVDGMLYSGT3VINNFLGSEPILMRTLGSQ 215 

I I : I I I I I : I I I I I I : I : I : I I I 1 I I I I M I i I I i I : I 
Db 140 LTSFKFLGKNED GKGRCPFDPAHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHS 195 

Qy 216 PVLKTDNFLRWLHHDAS FVAA IPST QWYFFFEETASEFDFFERLHTSRV 265 

I I : I : : I I : : I I I I I : I I I I I I : I : : I I : I : 

Db 196 P-LRTEYAI PWL-NEPS FVFADVI RKS PDS PDGEDDRVYFFFTEVSVEYEFVFRVLI PRI 253 

Qy 266 ARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVIRHAVLLPADSPTAPHIYAVF 324 

I I I I I I I I : I I I I I I : I t I I : I : I : : I I I I I : I : I : I I h I 

Db 254 ARVCKGDQGGLRTLQKKWTSFLKARLICSRPDSGLVFNVLRDVFVLRSPGLKVPVFYALF 313 

Qy 32 5 TSQWQVGGTRSSAVCAFSLLDIERVFK-GKYKE LNKETSRWTTYRGPETNPRPGSCS 38 0 

I I : I I I I I :: I III III: : * : : I Mi I I I I : I 

Db 314 TP — QLNNVGLSAVCAYNLSTAEEVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACI 371 

Qy 381 VGPS SDKALTFMKDHFLMDEQW GTPLLVKSGVEYTRLAVETAQGL 426 

: I I I I : I II I I I : I I hi I I I : : I : II 

Db 372 DSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIWDRTQAL 431 

Qy 427 DGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQG— AVFVG 484 

|| : I I : : I I : I I I I : I : : I : : I I I I I I I I I : I I ' : I hi 

Db 432 DGTVYDVMFVSTDRGALHKA-ISLEHAVHIIEETQLFQDFEPVQTLLLSSKKGNRFVYAG 490 

Qy 485 FSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPE 544 

: I I : I I I : : I II I I I I I h I I I I : I I I I I 

Db 491 S N S GWQAP LAF C GKHGT C ED C VL ARD P YC AW S P P TAT CVALHQT E S P S RG L I Q EMS GDA 550 

Qy 545 WACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASST 604 

| | : : I I I I I I : I I : I I 

Db 551 SVC PDKSKGSYRQHFFKHGGTAELKCSQKSNLARVFWKF-QNDVLKAESP 599 

Qy 605 VY NGSLLLIVQDGVGGLYQCWATE NGFSYPVISYWVDSQDQTLALDPELAGI 65 6 

I I: : : I h I I I : I I : h : I h : 

Db 600 KYGLMGRKNLLIFNLSEGDSGVYQCLSEERVKNKTVFQWAKHV LEVKW 649 

Qy 657 P REH VKVP LT RV- S GGAA1AAQQ S YW P H FVT VT VLFALVL S GAL 1 1 LVAS PLRA 709 

| : | I : I : I : : I : : I I I I II 

Db 650 PKPWAPTLSWQTEGSRIATK VLVASTQGSSPPTPA 686 

Qy 710 LRARGKVQGCETLRPGEKAP LSREQHLQSPKECRTSASD 748 

: : I I I I I : I I : = I I I Ml 

Db 687 VQATS — SGAITL-PPKPAPTGTSCEPKIVINTVPQLHSEKTMYLKSSD 732 



RESULT 6 
Q7Z3N1 

ID Q7Z3N1 PRELIMINARY; PRT; 681 AA. 



AC Q7Z3N1; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 2b, Last annotation update) 

DE Hypothetical protein DKFZp686A04130 . 

GN DKF2P686A04130. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Human uterus; 

RA Lauber J., Bahr A., Mewes H.W., Weil B., Amid C, Osanger A., Fobo G. , 

RA Han M. , Wiemann S.; 

RL Submitted (JUN-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BX537671; CAD97806.1; 

KW Hypothetical protein. 

SQ SEQUENCE 681 AA; 75401 MW; 9C4DAF83918BCDDA CRC64; 

Query Match 25.7%; Score 1034.5; DB 4; Length 681; 

Best Local Similarity 38.5%; Pred. No. 5.8e-80; 

Matches 243; Conservative 108; Mismatches 223; Indels 57; Gaps 21; 

Qy 151 CT FI ELQDS YLLP I S EDKVM — EGKGQS P FD PAHKHTAVLVDGML YS GTMNN FLGS E P I L 208 

I I : I : : : I : I : : I I I : I I I I I I I : : I I I I I : I I : : : I I : : I : 
Db 2 CTYINMENFTLARDEKGNVLLEDGKGRCPFDPNFKSTALWDGELYTGTVSSFQGNDPAI 61 

Qy 209 MRTLGSQPVLKTDNFLRWLHHDASFVAA — IPST QWYFFFEETASEFDFFE 258 

I : : I I I : : I I I I : I I I : I I : I I I I I I I I : I I I 

Db 62 SRSQSLRPT-KTESSLNWL-QDPAFVASAYIPESLGSLQGDDDKTYFFFSETGQEFEFFE 119 

Qy 259 RLHT S RVARVCKNDVGGEKLLQKKWTT FLKAQLLCTQ P GQ- LP FNVI RHAVLLPADS PTA 317 

I I : I I : i I I I I I : : I I : : I i : I I I I I I I I : : I III!:: | I I : 
Db 120 NTIVSRIARICKGDEGGERVLQQRWTSFLKAQLLCSRPDDGFPFNVLQDVFTL SPSP 176 

Qy 318 PH IYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPET 372 

I I I I I I I II I I I I I : : I : : I I I I I I I : I : II : I I I 
Db 177 QDWRDTLFYGVFTSQWHRGTTEGSAVCVFTMKDVQRVFSGLYKEVNRETQQWYTVTHPVP 236 

Qy 373 NPRPGSCSVGPS S DKALT FMKDHFLMDEQWGT P LLVKSGVE YTRLAVE 421 

I I I I : I : I : I I : I I I I II I II I I : : I I : I I 

Db 237 TPRPGACITNSARERKINSSLQLPDRVLNFLKDHFLMDGQVRSRMLLLQPQARYQRVAVH 296 

Qy 422 TAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAV 4 81 

II I : : | : : | | | I I I II I I I : : I I : I : I : I I : I I I : I : 
Db 297 RVPGLH-HTYDVLFLGTGDGRLHKAVSVG-PRVHIIEELQIFSSGQPVQNLLLDTHRGLL 354 

Qy 482 FVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNL— NSWKQDME 539 

: I I : I I I I I I : I I I I I : I I I I I : I I I I I II I I I : I 

Db 355 YAASHSGWQVPMANCSLYRSCGDCLLARDPYCAWSGSSCKHVSLYQPQLATRPWIQDIE 414 

Qy 540 RGNPEWAC-ASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAV 598 

: : I II : I I I : : : I I I : : I I I 1 I I I : I II 
Db 415 GASAKDLCSASSWSPSFVPTGE-KPCEQVQFQPNTWTLACPLLSNLATRLWLRNGAPV 473 



Qy 



599 PEASS— TVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAG- 655 



• • I • I III- -I -III- I II III ■ • -I • I 

Db 474 NAS AS CHVL PTGDLLLVGTQQLGE- FQCWS LEEGFQQLVAS YCP EWEDGVADQT DEGGS 532 

Qy 656 IPREHVKVPLTRVS GGAALAAQQ S YWPH FVTVTVL FALVL S GAL 1 1 LVAS P LRALR 711 

: I I : : I I I II: 1 : I I I I : : III: : : I : : : : 

Db 533 VP VI I S T S RVS AP AGGKAS WGAD RS YWKE FLVMCT L FVLAVLL PVL FL L YRH RN SMK 589 

Qy 712 ARGKVQGCETLRPGEKAPLSREQHLQSPKEC 742 

I | : : | : | : : I I I 

Db 590 VFLKQGECASVHP-KTCPV VLPPKTC 614 



RESULT 7 




Q8BJC1 




ID 


Q8BJC1 PRELIMINARY; PRT; 799 AA. 




AC 


Q8BJC1; 




DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 




DT 


01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 




DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 




DE 


Semaphorin 4D precursor (Fragment) . 




GN 


SEMA4D. 




OS 


Mus musculus (Mouse) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


oc 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 


ox 


NCBI TaxID-10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=NOD; TI SSUE=Thymus ; 




RX 


MEDLINE=2 2354683; PubMed= 12 466851; 




RA 


The FANTOM Consortium, 




RA 


the RIKEN Genome Exploration Research Group Phase I & II 


Team; 


RT 


"Analysis of the mouse transcriptome based on functional 


annotation 


RT 


60,770 full-length cDNAs . " ; 




RL 


Nature 420:563-573(2002). 




DR 


EMBL; AK088653; BAC40480.1; -. 




DR 


MGD; MGI: 109244; Sema4d. 




DR 


GO; GO: 0016020; C:membrane; IEA. 




DR 


GO; GO: 0004872; F: receptor activity; IEA. 




DR 


GO; GO: 0007275; P : development ; IEA. 




DR 


InterPro; IPR007110; Ig-like. 




DR 


InterPro; IPR003598; Ig_c2. 




DR 


InterPro; IPR003659; Plexin-like. 




DR 


InterPro; IPR002165; Plexin_repeat . 




DR 


InterPro; IPR001627; Sema. 




DR 


Pfam; PF00047; ig; 1. 




DR 


Pfam; PF01437; PSI; 1. 




DR 


Pfam; PF01403; Sema; 1. 




DR 


SMART; SM004 08; IGc2; 1. 




DR 


SMART; SM00423; PSI; 1. 




DR 


SMART; SM00630; Sema; 1. 




DR 


PROSITE; PS50835; IG_LIKE; 1. 




FT 


NON TER 1 1 




SQ 


SEQUENCE 799 AA; 88814 MW; 4 15D3E687150A59A CRC64; 





Query Match 24.7%; Score 995; DB 11; Length 7 99; 

Best Local Similarity 40.2%; Pred. No. 1.9e-76; 

Matches 237; Conservative 80; Mismatches 212; Indels 60; Gaps 18; 



Qy 77 YVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNV 136 

I I I I I I I : I : : : : : : : I I : I I I : I I I I I : I : I I : I I I I : 
Db 1 YVGARE AVFAVN ALN — I S E KQH EVYW KVS ED K K S KC AE KGK S KQT EC LN Y I RVL QP L S R 58 

Qy 137 THL YTCGTFAFS PACT FI ELQDS YLLP I SEDKVMEGKGQ S P FDPAHKHTAVLVDGML YSG 196 

I I I I I I I I I I : I I I I I I I I : I ! I I I I : I : I : I I I I I I 
Db 59 TSLYVCGTNAFQPTCDHLNLTSFKFLGKSED GKGRCPFDPAHSYTSVMVGGELYSG 114 

Qy 197 TMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAA IPSTQWYFF 246 

1 I I I I ! I I I : I I I : I : : I I : : i I I I I I I I 

Db 115 TSYNFLGSEPIISRNSSHSP-LRTEYAIPWL-NEPSFVFADVIQKSPDGPEGEDDKVYFF 172 

Qy 247 FEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVIR 305 

I I : I : : I : I I I I I I I I I I I : I I I I I I : I I i I : I : I : : I III::: 
Db 173 FTEVSVEYEFVFKLMI PRVARVCKGDQGGLRTLQKKWTS FLKARLI CSKPDSGLVFNI LQ 232 

Qy 306 HAVLL P AD S PT APH I YAVFT S QWQVGGT RS S AVCAFS LLD I ERVF- KGK YKE LNKET 361 

: I I I I I I I I I : I I I I I : : I : I I I : I I I : : : 

Db 233 DVFVLRAPGLKEP VF YAVFT P--QLNNVGLSAVCAYTLATVEAVFSRGKYMQS AT VEQSH 290 

Qy 362 SRWTTYRGPETNPRPGSCSVGPS SDKALTFMKDHFLMDEQW GTPL 4 07 

: : I III I I I I : I : I I I I : I I I I li : I I 

Db 291 TKWVRYNGPVPTPRPGACIDSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNRPK 350 

Qy 408 LVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPE 467 

I : I I i i : : I : I I I I : I I :: I I : I I I I I : I : : I I II I I I 

Db 351 LI KKDVNYTQI WDRTQALDGTFYDVMFI STDRGALHKAVI L-TKEVHVI EETQLFRDSE 409 

Qy 4 68 PVRNLQLAPTQGAVFV— GFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCL 525 

II II: : I II | : I | : 1 | | : I I I I I I I I I I : I I I I : I 

Db 410 P VLT L LLSSKKGRK FVYAG SN S GWQAP LAFC EKHGSCED C VLARD P YC AW S P AI KACVT 469 

Qy 526 LSAPNLNS — WKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHL 583 

I : I I I I I I I I I : III 

Db 470 LHQEEASSRGWIQDM SGDTSSCL — DKSKESFNQHFFKHGGTAELKCFQK 517 

Qy 584 SALASYYW — SHG — PAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATE 628 

III I : I III: I : : I I I : I I I : I 

Db 518 SNLARWWKFQNGELKAASPKYGFVGRKHLLIFNLSDGDSGVYQCLSEE 566 



RESULT 8 
Q810B2 

ID Q810B2 PRELIMINARY; PRT; 722 AA. 

AC Q810B2; 

DT 01-JUN-2 003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Sernaphorin M (Fragment) . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 



RA Jang W., Spilson S.V., Hua A. , Roe B., Meisler M.H.; 

RT "Large-scale comparative sequence analysis of human and mouse genomic 

RT DNA reveals coding regions of three new genes."; 

RL Submitted (DEC-1997) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF038652; AAD02103.1; 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART ; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 722 AA; 78612 MW; 716627589A9DB2D4 CRC64; 

Query Match 21.9%; Score 882; DB 11; Length 722; 

Best Local Similarity 33.2%; Pred. No. 8.7e-67; 

Matches 238; Conservative 102; Mismatches 288; Indels 88; Gaps 24; 

Qy 63 DFDTLLLSGDGNTLYVGAREAI LALDIQDPG-VPRLKNMIPWPASDRKKSECAFKKKSNE 121 

:': I I : : I I I I I I ! : : I I I : I I I II : : I I I I 
Db 15 NYSALLVDPASHTLYVGARDSIFALTLPFSGEKPR RIDWMVPETHRQNCR-KKGKKE 70 

Qy 122 TQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDPA 181 

: I I I I : : I I : I I I I I I I I I I I I : : : : : : I : I : I I : I I 
Db 71 DECHNFIQILAIANASHLLTCGTFAFDPKCGVIDVSSFQQV ERLESGRGKCPFEPA 126 

Qy 182 HKHTAVLVDGMLYSGTMNNFLGSEPILMRTLG-SQPVLKTDNFLRWLHHDASFVAAI 237 

: 11: I : I I : I : II I I : I I I : I : I : : : : I : I I : Mill: 
Db 127 QRS AAVMAGGVL YT AT VKN FLGT E P 1 1 S RAVGRAEDWI RT ET L S S WLNAP A- FVAAMVL S 185 

Qy 238 PSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQ 290 

: : II I I I : MM: I I I I I I I : I I I I I :: I I I I I I I 
Db 186 PAEWGDEDGDDEIFFFFTETSRVLDSYERIKVPRVARVCAGDLGGRKTLQQRWTTFLKAD 245 

Qy 291 LLCTQP--GQLP FN VI RH AVL L PAD S P T APH I YAVFT S Q WQ VGGT RS S AVC AF S L LD I E R 348 

III I I : : II I I : I : I I I : I I I 1 I I I II 

Db 246 LLCPGPEHGRASGVLQDMTELRPQPGAGTPLFYGIFSSQWE — GAAISAVCAFRPQDIRA 303 

Qy 349 VFKGKYKELNKETSRWTTYRGPET-NPRPGSC SVGPS SDKALTFMKDHF 396 

I M : I I : : I I I I I I I II I M I I : : I I 

Db 304 VLNGPFRELKHDCNRGLPVMDNEVPQPRPGECITNNMKFQQFGSSLSLPDRVLTFIRDHP 363 

Qy 397 LMDEQW GTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSS 453 

III I Mill: II: M : Mill I II M M 

Db 364 LMDRPVFPADGRPLLVTTDTAYLRWAHRVTSLSGKEYDVLYLGTEDGHLHRAVRIG-AQ 422 

Qy 454 AHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPH 513 

: : I : : I I I : : I I : : : I Ml Ml : ! I : I M I : I I : I I 

Db 423 LSVLEDLALFPETQPVESMKL — YHDWLLVGSHTEVTQVNTSNCGRLQSCSECILAQDPV 480 

Qy 514 CAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPN 573 

III I : I I I : I : I | | : : : I 

Db 481 CAW S - F RL DACVAH AGEH RGMVQ DIE S ADVS S L C P K EPGEHPWFEVPVATVG 532 



Qy 574 SILELPCPHLSALASYYWSHGPAAVPEASSTWNGSLLLIVQDGVGGLYQCWATENGFSY 633 

: : I I I I I II I I I : I II I :: I I I I I I I : 

Db 533 HW- L P C S P S S AWAS C VW - HQ P S GV — T S LT P RRD GL E VWT P GAMGAYAC E CQ E GGAAR 58 8 

Qy 634 PVI SYWVDSQDQTLALDPELAGI PREHVKVPLTRVSGGAALAAQQSYWPHFVTVTVLFAL 693 

I : I : | : | | | III | 
Db 589 WAAY SLVWGSQRGPANRAHTW GAGLVG FFLG 621 

Qy 694 VLSGALIILVAS PLRALRARGKVQGCETLRPGEKAPLS REQHLQS PKECR 743 

I I : : I : I : I I I I I I I I : : I I : : I 

Db 622 VLAASLTLLLIGRRQQRRRQRELLARDKVGLDLGAPPSGTTSYSQDPPSPSPEDER 677 



RESULT 9 
Q8QGU9 



ID Q8QGU9 PRELIMINARY; PRT; 756 AA. 

AC Q8QGU9; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21 f Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 3F. 

GN SEMA3F. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Watanabe Y., Nakamura H.; 

RT "Axon guidance of the trochlear nerve by Sema 3F along mid-hindbrain 

RT boundary."; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AB072930; BAB88691.1; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig__c2. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00423; PSI; 1* 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Immunoglobulin domain. 

SQ SEQUENCE 756 AA; 85196 MW; C36754C02541ED88 CRC64; 



Query Match 21.2%; Score 854.5; DB 13; Length 756; 

Best Local Similarity 30.7%; Pred. No. 2.2e-64; 

Matches 245; Conservative 123; Mismatches 278; Indels 151; Gaps 34; 

Qy 32 GGGGQGPMPRVRYYAGDERRALS FFHQKGL QDFDTLLLSGDGNTLYVG 79 

I I MM: Ml I MM I : : I II 

Db 22 GKDGVPPTPRVQ LSFKELKATGTAHFFNFLLNSSDYRILLKDEDHDRMYVG 72 



Qy 8 0 AREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHL 139 

: : : : I : I I : I : I : I I I I I : : II I : : : I I I I I : : : I III 
Db 73 SKDYVLSLDLHD — INREPLI IHWPASQQRI EECILSGKNSNGECGNFI RLIQPWNRTHL 130 

Qy 14 0 YT CGT FAFS PACT FI ELQDS YLLP I SEDKVMEGKGQS PFDPAHKHTAVLVDGMLYS 195 

I I I I I : : I I I I : I I I : : I I : III: : I I : I : : I I : 

Db 131 YVCGTGAYNPICAFINRGRKAQD-YIFYLEPDKLESGKGKCSYDPKVDTVSALINEELYA 189 

Qy 196 GTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAA — IPST QWYFFF 247 

I :|:|:: : l|:| I ::|| : III • I : I I I II : :|||| 
Db 190 GVYIDFMGTDAAI FRTMGKQTAMRTDQYNSRWL-NDPAFVRAQLIPDSSERNDDKLYFFF 24 8 

Qy 248 EETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLPFNV 303 

I : : : :: I I : I : I I I I I I I I : I I I I I : I : I : M : I : 

Db 249 REKSADAPLSPGVY-SRIGRICLNDDGGHCCLVNKWSTFLKARLVCSVPGPDGIETHFDE 307 

Qy 304 IRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSR 363 

: : : I INN:: | : I I I I : I : I I I I : : 

Db 308 LQDVFIQQTQDTKNPVIYAVFSASGSV — FKGSAVCVYSMADIRMVFNRPFAHKEGPNYQ 365 

Qy 364 WTTYRGPETNPRPGSCSVG PS S DKALT FMKDHFLMDEQWGT PLLVK 410 

III Nihil II I : : I I : I I I I I I : I : 

Db 366 WMPYTGKMPYPRPGTCPGGTFTPSMKSTKDYPDEVINFMRSHPLMYHAVYPAHRQPLWR 425 

Qy 411 SGVEY— TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAW SGDSSAHLVEEIQLFP 464 

: I I I : I I : II : I : : I I I I : : I : I : : : : I I I : : I 

Db 42 6 TNVNYRFTTIAVDQVDAADGR-YEVLFLGTDRGTVQKVIVLPRDDMETEELMLEEIEVFK 4 84 

Qy 465 DPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTC 523 

||::::: : : : I : I I : I I I I : I I I I I I I I : I I I I : I 

Db 485 VPAPIKMMTISSKRQQLYVSSAVGVTHLALHRCDVYGEACADCCLARDPYCAWD--GKAC 542 

Qy 524 CLLSAPN- LNSWKQDMERGNPEWACAS GPMS RSLRPQS RPQI I KEV- LAVPNS I LELPCP 581 

I I : I : I I : I I I I I : : : I I I I I 

Db 543 S RYS AS S KRRS RRQDVRHGNPMRQC RGYNSNANKNTVEAVQYGVEGSTAFLECQ 596 

Qy 582 HLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWAT 627 

I I : I : I I : I I I : I I I I I I I 

Db 597 PRSPQATVKWLLQKDNSDRRKELRTEGGRAL RTEQGLLLRALQLSDSGLYSCTAT 651 

Qy 62 8 ENGFS YPVI S YWVD S QDQTLA LDPEL AG I P R~ EH VKVP LT RVS GG 671 

I I I : I I : I I III I II : : III I 

Db 652 ENNFKHTVTKV QLRVLAARAVHAVLLQGELPPAALPGAPTPRYQDLLQLLTRPELG 707 

Qy 672 AALAAQQSYW P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC ET L R P G 725 

I : I I I I I I : I : : : I 
Db 708 LLDQYCQGFWRPPAPGP PEPLAALKAK-ELQ D 738 

Qy 72 6 EKAPLSREQHLQSPKEC 742 

: I I I I I hi 

Db 739 QKKPRSRRNH — PPESC 753 

RESULT 10 
Q8JIW9 

ID Q8JIW9 PRELIMINARY; PRT; 774 AA. 

AC Q8JIW9; 



DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 3A. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Head; 

RA Tannahill D., Nielsen J., Regan A.G.; 

RT "Xenopus Semaphorin 3A. "; 

RL Submitted (APR-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY030051; AAK38166.1; -. 

DR GO; GO: 0005634; Crnucleus; IEA. 

DR GO; GO: 0003677; F: DNA binding; IEA. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR001005; Myb_DNA_binding . 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IGJLIKE; 1. 

DR PROSITE; PS00037; MYB_1; 1. 

SQ SEQUENCE 774 AA; 89319 MW; 3127FD2E0CCD5FA4 CRC64; 

Query Match 21.0%; Score 847; DB 13; Length 774; 

Best Local Similarity 31.1%; Pred. No. le-63; 

Matches 227; Conservative 129; Mismatches 276; Indels 98; Gaps 27; 

Qy 17 FLFQLLQLLLPTTTAGGGGQGPMPRVRY YAGDERRALSFFHQKGLQDFDTLLLSGDG 73 

I I I I : I I : : I I : I : : : I : : | : | : 

Db 9 FLF — LGVSLTTRLNCQNVKNNVPRLRLSYKEMVESNNLITFNGWPNSSSYNTFVLDEER 66 

Qy 74 NTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVS 133 

I I I I I : : I : : : : I : : : I I : : : : I I : I : I I I I : I I : 
Db 67 GRLYVGAKDHI FS FNLAN- -VKEFQKIL-WPVTNI RRDECKWAGKDI VKECANFI KVLKA 123 

Qy 134 YNVTHLYTCGTFAFSPACTFIEL QDSYLLPISEDKVME-GKGQSPFDPAHKHTAV 187 

I I I I I I I I I II I I I : I : : I i : II I I : I : I I : I I : : 

Db 124 YNHTHLYACGTGAFHPVCTYIDVGHYPEQDNVFK — LEDS FFENGRGKS P YDPKLLTAS I 181 

Qy 188 LVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAAI PS 239 

1:1111111 : I : I : : I I I I : : : I : I I I : I hi I 

Db 182 LIDGELYSGTAADFMGRDFAI FRTLGNHHPIRTEQHDSRWL-NDPRFIGAYLVPESDNPE 24 0 

Qy 240 TQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG-- 297 

I I I I I I I : : : : I : : : I I I I I I : I I I I I I I I I : I : I : I I 
Db 241 DDKVYFFFRENAIDGEHTGKATHARIGQLCKNDFGGHRSLVNKWTTFLKARLICSVPGPN 300 



Qy 2 98 — Q L P FN VI RH AVL L PAD S P TAP H I YAVFT S QWQ VGGT RS S AVC AF S L L D I E RVFK GK Y K 355 

I : : : I I : : I I : I i I i I : : : I II I : I I I I I I I I I 
Db 301 GIDTHFDELQDWLMNSKDPKNPWYAVFTTSSNI — FKGSAVCMYSLTDIRRVFLGPYA 358 

Qy 356 ELNKETSRWTTYRGPETNPRPGSC SVGPSSDKALTFMKDHFLMDEQW 4 03 

: : I : : I I I I I : I I I : : I : I I I 

Db 359 HRDGPNYQWVPFQGRVPYPRPGTCPRQTFGGFDSTKDLPDEVIMFARSHPAMYNPVFPIN 418 

Qy 404 GTPLLVKSGVEY--TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAV VSGDSSAHL 456 

|:::|: hi h: I: : II : ll::ll I — I I I I 

Db 419 NRP 1 1 1 KT EVD YQ FTQI WDRVEAEDG- QYDVMFI GTDMGT I LKWS VP KETWTDLEEVL 477 

Qy 457 VEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCA 515 

: I I : : I : I : : : : : I : : : I I I I :: I I I I : : I : I I I I I I : II 
Db 478 LEEMTVFREPTAI SAMEISTKQQQLYIGSSVGVSQLPLHRCDVYGKACAECCLARDPYCA 537 

Qy 516 WDPESRTCCLLSAPNL — NSWKQDMERGNPEWACASGPMSRSLRPQSRP — QIIKE — VL 569 

II I I I : : | | : | : | | : I : | | I : : | : 

Db 538 WDGSS CSRYFPTAKRRTRRQDIRNGDPLTHCS DLQHQDDPHRQSLEEKII Y 588 

Qy 570 AVPNSILELPCPHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVGGL 621 

I I I I I I I : I II : I I I : : I : 

Db 5 89 GVENSSTFLECSPKSQRALVFWQFQKQNEEKKDEIKVDERKIKTEHGLLLRTLKKRDSGI 648 

Qy 622 YQ C WAT EN G F S Y P VI S YWVD S Q DQT L AL D P E LAG I P REH VKVP LT RVS GGA 672 

1111:11 I I I I I I I I : I : I 

Db 649 YYCNAVEHGFM QTL-LKVTLEI IDTEHLDELLHKEDEGGDSHKHKEPS 695 

Qy 673 -ALAAQQSYW 681 

: : : I I 
Db 696 NSMSPTQKIW 7 05 



RESULT 11 
Q8TB71 

ID Q8TB71 PRELIMINARY; PRT; 74 8 AA. 

AC Q8TB71; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Muscle; 

RA Strausberg R. ; 

RL Submitted (FEB-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC024220; AAH24220.1; 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1- 



DR Pfam; PF014 03; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 748 AA; 83034 MW; 587C53CB65AB4656 CRC64; 

Query Match 20.3%; Score 817.5; DB 4; Length 748; 

Best Local Similarity 31.4%; Pred. No. 3.3e-61; 

Matches 208; Conservative 114; Mismatches 254; Indels 87; Gaps 24; 

Qy 32 GGGGQGPMPRVRYYAGDERRALSF FHQKGLQDF DTLLLSGDGNTLYV 78 

I I I I I I I I : I I I I I I I : : I : I 

Db 2 0 GLGSAAPSP PRLRLSFQELQAVJH — GLQTFSLERTCCYQALLVDEERGRLFV 69 

Qy 79 GAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTH 138 

|| : : | : : : : : : I I I : I I : I I : I I I : : : t : i I I I 
Db 7 0 GAEN HVAS LN L DN — IS KRAK KLAW P AP VEWRE E CNWAGKD I GT ECMN FVKL LHAYN RT H 127 

Qy 139 LYTCGTFAFSPACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYS 195 

I | | | | | | I I : I : : : I : : : : I I I : I I : I I I : : I I I III 
Db 128 LLACGTGAFHPTCAFVEVGHRAEEPVLRLDPGRIEDGKGKSPYDPRHRAASVLVGEELYS 187 



Qy 196 GTMNNFLGSEPILMRTLGSQPVLKTD-NFLRWLHHDASFVAAI PSTQWYFFF 247 

I : : I : : I : I I : I I : I : : III:: II I : I I I I 

Db 18 8 GVAADLMGRDFTIFRSLGQRPSLRTEPHDSRWL-NEPKFVKVFWIPESENPDDDKIYFFF 24 6 



Qy 24 8 EETASE-FDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQP GQLPFNV 303 

I I I I II I I I : : I : I I I I I : : I I I I I I I I I : I : I : I I I : 

Db 247 RETAVEAAPALGRLSVSRVGQICRNDVGGQRSLVNKWTTFLKARLVCSVPGVEGDTHFDQ 30 6 

Qy 304 IRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSR 363 

: : II: I : I I I I : : I I I I I : I : I : I I I : : 
Db 307 LQDVFLLS S RDHRT P LL YAVFST S S I FQG SAVCVYSMNDVRRAFLGPFAHKEGPMHQ 363 

Qy 364 WTTYRGPETNPRPGSC S VGP S S DKALT FMKDHFLMDEQW GTPLLVKS 411 

I : I : I I I I I I I I : I : : I I I I : III:: 

Db 364 WVSYQGRVPYPRPGMCPSKTFGTFSSTKDFPDDVIQFARNHPLMYNSVLPTGGRPLFLQV 423 

Qy 412 GVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHK — AWSG DSSAHLVEEIQLFP 464 

I I I : : I : I I I : I : : : I I I : : I : I I : I : I I : : I 

Db 424 GANYTFTQIAADRVAAADGH-YDVLFIGTDVGTVLKVT SVPKGSRPSAEGLLLEELHVFE 4 82 

Qy 465 DPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTC 523 

| | : : | : : : : : I I : : I : : I : I I i I I I : I I I I 
Db 483 DSAAVTSMQISSKRHQLYVASRSAVAQIALHRCAAHGRVCTECCLARDPYCAWD GVA 539 

Qy 524 CLL S APN LN S — WKQDMERGN P EWACAS G PMS RS LRPQ S RPQ 1 1 K- EVLAVPN S I LEL P C 58 0 

I | : : | | : | : | | : I I I : : : : I I I II 

Db 540 CTRFQPSAKRRFRRQDVRNGDPSTLCSG DS SRPALLEHKVFGVEGS SAFLEC 591 

Qy 581 PHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFS 632 

III : I I : III:: I : I I I I I I : 

Db 592 EPRSLQARVEWTFQRAGVTAHTQVLAEERTERTARGLLLRRLRRRDSGVYLCAAVEQGFT 651 



Qy 



633 YPV 635 



Db 



652 QPL 654 



RESULT 12 
Q8BMF6 



ID Q8BMF6 PRELIMINARY; PRT; 777 AA. 

AC Q8BMF6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 3D precursor homolog. 

GN SEMA3D OR 4 631426B19RIK. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 

RX MEDLINE=22354683; PubMed=12466851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,7 70 full-length cDNAs . " ; 

RL Nature 42 0:563-573(2002). 

DR EMBL; AK0317 04; BAC27522.1; -. 

DR MGD; MGI: 1860118; Sema3d. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR001092; HLH_basic. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 



DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS00038; HLH_1 ; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

SQ SEQUENCE 777 AA; 89562 MW; B7204D82288B89CE CRC64; 

Query Match 19.5%; Score 788; DB 11; Length 777; 

Best Local Similarity 31.2%; Pred. No. 1.2e-58; 

Matches 212; Conservative 111; Mismatches 279; Indels 78; Gaps 25; 

Qy 11 WSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGD ERRALS FFHQKGLQDFDTL 67 

I : I : : III! : : I I : : I : I I I I I 

Db 22 WMMLIMTV LFLPVTET SKQNIPRLKLTYKDLLLSNTCIPFLGSSEGLDFQTL 73 

Qy 68 LLSGDGNTLYVGAREAILALDIQDPGVPRLKNM — IPWPASDRKKSECAFKKKSNETQCF 125 

II: I : I I :: : I z I II Mil:: | 1 : I 

Db 74 LLDEERGILLLGAKDHVFLLSLVDLN KNFKKIYWPAAKERVELCKLAGKDANAECA 129 

Qy 126 NFIRVLVSYNVTHLYTCGTFAFSPACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAH 182 

I I I I I I I I I I : I I I I I I I I : I : I : : : : : I : : I I I I 
Db 130 NFIRVLQPYNKTHVYVCGTGAFHPLCGYIDLGANKEELIFKLDTHNLESGRLKCPFDPQQ 189 



Qy 183 KHTAVLVDGMLYS GTMNNFLGS E P I LMRTLG SQPVLKTD-NFLRWLHHDASFVAAI- 237 

: I : I I I I I I : : I I I : I : I I : : I I : I I : I I : 

Db 190 PFASVMTDEHLYSGTASDFLGKDTAFTRSLGLMQDHHSIRTDISEHHWL-NGAKFIGTFP 24 8 

Qy 238 PSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQL 291 

I : | | 1 | | : : | : I I I I I I I I I I I 1 I :: i I I I I I I I I : I 

Db 24 9 IPDTYNPDDDKIYFFFRESSQEGSTSDRSILSRVGRVCKNDVGGQRSLINKWTTFLKARL 308 

Qy 292 LCTQPG Q L P FN VI RH AVL L PAD S P TAP H I YAVFT S QW Q VG GT RS S AVC AF S L L D I E 347 

: I : I I I : : : 111 I : I II I : : : I I I I : I : I I 

Db 309 ICSIPGSDGADTHFDELQDIYLLPTRDERKPWYGVFTTTSSI — FKGSAVCVYSMADIR 366 

Qy 348 RVFKGKYKELNKETSRWTTYRGPETNPRPGSC S VGP S S DKALT FMKDH FL 397 

I I I I I I I I I I II : I I | : : | : : | : 

Db 367 AVFNGPYAHKESADHRWVQYDGRIPYPRPGTCPSKTYDPLIKSTRDFPDDVISFIRRHPV 426 

Qy 398 MDEQ WGTPLLVKSGVEY— TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDS 452 

I : III : | : | | : : I : I I : I I : I I I I : : I I 

Db 427 MYKSVYPVAGAPTFKRINVDYRLTQIWDHWAEDG-QYDVMFLGTDIGTVLKWSISKE 4 85 

Qy 453 SAHL VEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCV 507 

: : : I I : I : I I : I : : I : I : : I I I : : : I I : : I I I 
Db 486 KWNMEEWLEELQVFKHPTAI LNMELSLKQQQLYVGSWDGLVQLSLHRCDTYGKACADCC 545 

Qy 508 LARDPHCAWDPESRTCCLLSAP— NLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQII 565 

11111:1111 : I II : : | | : : | : I I : I : : : : 

Db 546 LARDPYCAWDGNA CSRYAPTSKRRARRQDVKYGDPITQC — WDIEDSISHETADE — 598 

Qy 566 KEVLAVPNS I LELPCPHLSALAS YYW SHGPAAVPEAS — STVYNGSLLLI VQD 616 

I : : : I I I I I I I I : I I I I : : I 

Db 599 KVI FGIEFNSTFLECIPKSQQASVEWYIQRSGDEHREELKPDERIIKTDY-GLLIRSLQK 657 

Qy 617 GVGGLYQCWATENGFSYPVI 636 

I : I I I I : I : : : 
Db 658 KD S GMYYCKAQEHT FI HT I V 677 



RESULT 13 
Q8BH34 

ID Q8BH34 PRELIMINARY; PRT; 777 AA. 

AC Q8BH34; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin 3D precursor homolog. 

GN SEMA3D OR 4 63142 6B19RIK . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Kidney, and Skin; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 



RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL ; AK028900; BAC26185.1; -. 

DR EMBL; AK052671; BAC35092.1; 

DR MGD; MGI : 1860118; Sema3d. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR001092; HLH_basic. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS00038; HLH_1; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

SQ SEQUENCE 777 AA; 89548 MW; 5450D8D45D1BDABF CRC64; 

Query Match 19.5%; Score 7 87; DB 11; Length 777; 

Best Local Similarity 31.2%; Pred. No. 1.5e-58; 

Matches 212; Conservative 111; Mismatches 279; Indels 78; Gaps 25; 

Qy 11 WSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGD ERRALS FFHQKGLQDFDTL 67 

I : I : : I I 1 I : : I I : : I : I I I I I 

Db 22 WMMLIMTV LFLPVTET SKQNIPRLKLTYKDLLLSNTCIPFLGSSEGLDFQTL 73 

Qy 68 LLSGDGNTLYVGAREAILALDIQDPGVPRLKNM — IPWPASDRKKSECAFKKKSNETQCF 125 

i I : | : | | : : : | : I II I I I I : : I I : I 

Db 74 LLDEERGILLLGAKDHVFLLSLVDLN KNFKKIYWPAAKERVELCKLAGKDANAECA 129 

Qy 126 NFIRVLVSYNVTHLYTCGTFAFSPACTFIEL QDSYLLPISEDKVMEGKGQSPFDPAH 182 

I I I I I I I I I I : I I I I I I I I : I : I : : : : : I : : I I I I 
Db 130 NFIRVLQPYNKTHVYVCGTGAFHPLCGYIDLGANKEELIFKLDTHNLESGRLKCPFDPQQ 189 

Qy 183 KHTAVLVDGML Y S GTMNN FLGS E P I LMRT LG SQPVLKTD-NFLRWLHHDAS FVAAI - 237 

: I : I I I I I I : : I I I : I : I I : : I I : I I : I I : 

Db 190 PFASVMTDEHLYSGTASDFLGKDTAFTRSLGLMQDHHSIRTDISEHHWL-NGAKFIGTFP 248 

Qy 238 PSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQL 291 

I : I I I I I : : I : I I I I I I I I I I I I I : : I I I I I I I I I : I 
Db 24 9 IPDTYNPDDDKIYFFFRESSQEGSTSDRSILSRVGRVCKNDVGGQRSLINKWTTFLKARL 308 

Qy 292 LCTQPG Q L P FNVI RH AVLL P AD S PTAPH I YAVFT S QWQVGGT RS S AVC AF S L L D I E 347 

: I : I I I : : : III I : I I I I : : : I I I I : I : I I 

Db 309 ICSIPGSDGADTHFDELQDIYLLPTRDERNPVVYGVFTTTSSI — FKGSAVCVYSMADIR 366 

Qy 34 8 RVFKGKYKELNKETSRWTTYRGPETNPRPGSC SVGPSSDKALTFMKDHFL 397 

I I I I I I I I I I I I : I I | : : | : : I : 

Db 367 AVFNGPYAHKESADHRWVQYDGRIPYPRPGTCPSKTYDPLIKSTRDFPDDVISFIRRHPV 426 

Qy 398 MDEQ WGT P LLVKS GVE Y — T RLAVETAQ GLDGH S HLVMYLGTT T G S LH KAWS GD S 452 

I : III : | : | | : : | : I I : I I : I I I I : : I I 

Db 427 MYKSVYPVAGAPTFKRINVDYRLTQIWDHWAEDG-QYDVMFLGTDIGTVLKWSISKE 4 85 

Qy 453 SAHL VEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCV 507 

: : : I I : I : I I : I : : I : I : : I I I : : : I I : : I I I 
Db 48 6 KWNMEEWLEELQVFKHPTAI LNMELSLKQQQLYVGSWDGLVQLSLHRCDTYGKACADCC 545 



Qy 508 LARD PHCAWDPESRTCCLL SAP — NLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQII 565 

I I I I I : I I I ! : I II : : I I : : | : I | : | : : : : 

Db 546 LARDPYCAWDGNA CSRYAPTSKRRARRQDVKYGDPITQC — WDIEDSISHETADE — 598 

Qy 566 KEVLAVPNSILELPCPHLSALASYYW SHGPAAVPEAS — STVYNGSLLLIVQD 616 

I : : : II I I I I I I : I I I I : : I 

Db 599 KVIFGIEFNSTFLECIPKSQQASVEWYIQRSGDEHREELKPDERIIKTDY-GLLIRSLQK 657 

Qy 617 GVGGLYQCWATENGFSYPVI 636 

I : I I I I : I : : : 

Db 658 KDSGMYYCKAQEHTFIHTIV 677 



RESULT 14 
Q9NS98 

ID Q9NS98 PRELIMINARY; PRT; 782 AA. 

AC Q9NS98; Q9H7Q3; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Semaphorin sem2 (FLJ00014 protein) . 

GN SEM2 OR FLJ00014. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Seki N., Hattori A., Hayashi A., Kozuma S., Muramatsu M. , Miyajima N. , 

RA Saito T. ; 

RT "Human semaphorin."; 

RL Submitted (JUL-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 58-782 FROM N.A. 

RC TISSUE-Spleen; 

RA Ohara O., Nagase T., Kikuno R. , Okumura K. ; 

RT "The nucleotide sequence of a long cDNA clone isolated from human 

RT spleen."; 

RL Submitted (AUG-2 000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB029496; BAA98132. 1; 

DR EMBL; AK024425; BAB15715.1; 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

SQ SEQUENCE 782 AA; 86700 MW; 85CB424874DF6663 CRC64; 



Query Match 18.3%; Score 737; DB 4; Length 782; 

Best Local Similarity 28.9%; Pred. No. 3e~54; 



Matches 236; Conservative 128; Mismatches 320; Indels 132; Gaps 33; 



Qy 11 WSLLGLFLFQLLQLLLPTTTAGGGGQGP MPRVRYYAGD ERRALS FFHQKGLQ 62 

I I I I I I I I I : I I : I I I : I : I 

Db 10 WLLGGLLLH GGSSGPSPGPSVPRLRLSYRDLLSANRSAIFLGPQGSL 56 

Qy 63 DFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNET 122 

: : I : I : : i : I : : I : I I : : : I I : : I I I : I 
Db 57 NLQAMYLDEYRDRLFLGGLDALYSLRL-DQAWPDPREVL-WPPQPGQREECVRKGRDPLT 114 

Qy 123 QCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIEL — QDSYLLPISEDKVMEGKGQSPFDP 180 

: I I I : I I I : I I i I I 1 I I I I I I : : : : I : I I : I : I : I 
Db 115 ECANFVRVLQPHNRTHLLACGTGAFQPTCALITVGHRGEHVLHLEPGSVESGRGRCPHEP 174 

Qy 181 AH KH T AVLVD GML Y S GTMNN FL G S E P I LMRT L G S Q P VL KT DN FL RW LH H DAS FV — AAIP 238 

: : : I I I I : I : I M I : : I : I : I I : : I : : I I I II I I I 
Db 175 SRPFASTFIDGELYTGLTADFLGREAMIFRSGGPRPALRSDSD-QSLLHDPRFVMAARIP 233 

Qy 239 STQWYFFFEETAS EFD- FFERLHTSRVARVCKNDVGGEKLLQKKWTT FLKAQLL 292 

I I t I I II I : I I I I I I I I I I ::: I I I : I I I I I : I : 

Db 234 ENSDQDNDKVTFFFSETVPSPDGGSNHWVSRVGRVCWDAGGQRVLVNKWSTFLKARLV 293 

Qy 2 93 CTQPG QLP FNVI RHAVLLPADS PTAPH I YAVFT S QWQVGGTRS S AVCAFS LLD I ER 34 8 

|:M : I : : II : : : I I : I :: I : I I I : : I I 
Db 294 CSVPGPGGAETHFDQLEDVFLLWPKAGKSLEVYALFSTVSAV — FQG FAVCVYHMAD I WE 351 

Qy 349 VFKGKYKELNKETSRWTTYRGPETNPRPGSC S VGP S S DKALT FMKDH 395 

Ml: : : i I I I I I I I I I : I I : I 

Db 352 VFNGPFAHRDGPQHQWGPYGGKVPFPRPGVCPSKMTAQPGRPFGSTKDYPDEVLQFARAH 411 

Qy 396 FLMDEQV VGTPLLVKSGV — EYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAVV — 44 8 

|| | I I : I I I : : : s : I : : I I : : I : : i I I : I I : I : 
Db 412 PLMFWPVRPRHGRPVLVKTHLAQQLHQIWDRVEAEDG-TYDVIFLGTDSGSVLKVIALQ 470 

Qy 449 SGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCV 504 

I : : : I I : I : I I I : : : : : : : : I I I I : : I I : I 
Db 471 AGGSAEPEEWLEELQVFKVPTPITEMEISVKRQMLYVGSRLGVAQLRLHQCETYGTACA 530 

Qy 505 DCVLARDPHCAWDPESRTCCLLSAPNLNS WKQDMERGNPEWAC ASGPMS 553 

: I I I I I I : I I II I I 1:1 : I I : III I I I :: 

Db 531 E C C LARD P YCAWD GAS CTHYRPSLGKRRFRRQDIRHGNPALQCLGQSQEEEAVGLVA 587 

Qy 554 RSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYW SHGPAAVPEASSTVY- 606 

:: I :: I I I I: I III " 

Db 588 ATM VYGTEHNSTFLECLPKSPQAAVRWLLQRPGDEGPDQVKTDERVLHT 636 

Qy 607 -NGSLLLIVQDGVGGLYQCWATENGFSYPVIS YWVDSQ-DQTLALDPE 652 

II : I I I hill I : I I I I : I : 

Db 637 ERGLLFRRLSRFDAGTYTCTTLEHGFSQTWRLALWIVASQLDNLFPPEPKPEEPPARG 696 

Qy 653 -LAG I PREHVKVPLTRVSGGAALAAQQS Y W PH FVT VT VL FALVL S GAL 1 1 LVAS P L 707 

Ml: : : : | I | III II I 
Db 697 GLASTPPKAWYKDILQLIGFANLPRVDEYCERVWCRGTT ECSGCF RSRS 745 

Qy 708 RALRARGKVQGCETLRPGEKAPLSR — EQHLQSPKE 741 

I : II I I I hi II : I : : h I 

Db 746 RGKQARGK — SWAGLELGKKMK- SRVHAEHNRT PRE 778 



RESULT 15 
Q96GX0 

ID Q96GX0 PRELIMINARY; PRT; 635 AA. 

AC Q96GX0; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC009113; AAH09113.1; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM0 0630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 635 AA; 70694 MW; D994099B476B9210 CRC64; 

Query Match 18.3%; Score 736; DB 4; Length 635; 

Best Local Similarity 32.4%; Pred. No. 2.7e-54; 

Matches 180; Conservative 99; Mismatches 215; Indels 62; Gaps 19; 

N F I RVL VS YN VT H L YT C GT FAF S P AC T F I E L QDSYLLPISEDKVMEGKGQSPFDPAH 182 

I I : : : I : I I I I I II! II I I hh : : I : : : : II I : I I : I I I 
NFVKLLHAYNRTHLLACGTGAFHPTCAFVEVGHRAEEPVLRLDPGRIEDGKGKSPYDPRH 61 

KHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTD-NFLRWLHHDASFVAAI 237 

: : I I I I I I I : : I : : I : I I : I I : I : : III:: II 
RAASVLVGEELYSGVAADLMGRDFTIFRSLGQRPSLRTEPHDSRWL-NEPKFVKVFWIPE 120 

PSTQWYFFFEETASE-FDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLC 293 

I : I I I I I I I I II I I I : : I : I II I I : : I I I I II I I I : I : I 

S ENPDDDKI YFFFRETAVEAAPALGRLS VS RVGQI CRNDVGGQRS LVNKWTT FLKARLVC 180 

TQP GQ L P FNVI RH AVL L PAD S P TAP H I YAVFT S QWQVGGT RS S AVC AFS L LD I E RVF 350 

: I I I : : : I I : I : I I I I : : I I I I I : I : I : I I 
SVPGVEGDTHFDQLQDVFLLS SRDHRTPLLYAVFSTS S I FQG SAVCVYSMNDVRRAF 237 

KGKYKELNKETSRWTTYRGPETNPRPGSC SVGPSSDKALTFMKDHFLMDEQ 4 01 

I : : I : I : I I I I I i I I : I : : I I I 



Qy 


126 


Db 


2 


Qy 


183 


Db 


62 


Qy 


238 


Db 


121 


Qy 


294 


Db 


181 


Qy 


351 


Db 


238 



Qy 4 02 W GTPLLVKSGVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHK — AWSG D 451 

I : | | | : : I | I : : I : I I I : I : :: I I I :: I : I | 

Db 298 VLPTGGRPLFLQVGANYTFTQIAADRVAAADGH-YDVLFIGTDVGTVLKVISVPKGSRPS 356 

Qy 452 SSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLAR 510 

: | : | | : : I I I : : I : : : : : I I : : I : : I : I I I I 

Db 357 AEGLLLEELHVFEDSAAVTSMQISSKRHQLYVASRSAVAQIALHRCAAHGRVCTECCLAR 416 

Qy 511 DPHCAWDPESRTCCLLSAPNLNS — WKQDMERGNPEWACASGPMSRSLRPQSRPQIIK-E 567 

11:1111 I I : : I I : I : I I : I I I : : : : 

Db 417 DPYCAWD GVACTRFQPSAKRRFRRQDVRNGDPSTLCSG DSSRPALLEHK 465 

Qy 568 VLAVPNSILELPCPHLSALASYYW SHGPAAVPEASSTVYNGSLLLIVQDGVG 619 

I I I I I 1 I I : I I : III:: 

Db 4 66 VFGVEGSSAFLECEPRSLQARVEWTFQRAGVTAHTQVLAEERTERTARGLLLRRLRRRDS 525 

Qy 62 0 GLYQCWATENGFSYPV 635 

I : I I I I I I : I : 
Db 52 6 GVYLCAAVEQGFTQPL 541 



Search completed: May 5, 2004, 17:30:49 
Job time : 55 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



May 5, 2004, 17:25:21 ; Search time 17 Seconds 

(without alignments) 
2330.905 Million cell updates/sec 

US-10-015-391A-277 
4031 

1 MALPALGLDPWSLLGLFLFQ CRTS AS DVDADNNCLGTEVA 761 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


4031 


100.0 


761 


1 


SM4A_HUMAN 


Q9h3sl 


homo sapien 


2 


3280.5 


81.4 
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3 
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31.5 


832 


1 
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4 
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29.7 
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1 
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mus musculu 


5 
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26.8 


834 


1 


SM4C MOUSE 


Q64151 
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6 
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26.3 
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1 
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7 
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25.9 


8 62 
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8 
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24.4 
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19 
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HUMAN 


Q9p283 


homo sapien 


31 
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32 
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33 
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34 
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39 
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16. 


7 
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42 
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16. 
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43 
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16. 


2 


771 


1 
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44 
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45 
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1 
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Q17330 
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alignments 



RESULT 1 
SM4A_HUMAN 

ID SM4A_HUMAN STANDARD; PRT; 761 AA. 

AC Q9H3S1; Q8WUA9; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Semaphorin 4A precursor (Semaphorin B) (Sema B) . 

GN SEMA4A OR SEMB. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Seki N., Hattori A., Hayashi A., Kozuma S., Muramatsu M. , Miyajima N . , 

RA Saito T. ; 

RT "Human semaphorin B."; 

RL Submitted (JUN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Colon; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 



RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J./ Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N . A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S . , Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J. W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: INHIBITS AXONAL EXTENSION BY PROVIDING LOCAL SIGNALS TO 
CC SPECIFY TERRITORIES INACCESSIBLE FOR GROWING AXONS (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC SIMILARITY: Contains 1 immunoglobulin- like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB029394; BAB20087.1; -. 

DR EMBL; BC020974; AAH20974.1; -. 

DR Genew; HGNC: 10729; SEMA4A. 

DR MIM; 607292; -. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1- 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

KW Signal; Transmembrane; Immunoglobulin domain; Multigene family; 

KW Neurogenesis; Developmental protein; Glycoprotein. 

FT SIGNAL 1 32 POTENTIAL. 

FT CHAIN 33 761 SEMAPHORIN 4A. 

FT DOMAIN 33 683 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 684 704 POTENTIAL. 

FT DOMAIN 705 761 CYTOPLASMIC (POTENTIAL). 

FT DOMAIN 64 478 SEMA. 

FT DOMAIN 496 548 PSI. 

FT DOMAIN 573 631 IG-LIKE C2-TYPE. 

FT DISULFID 580 624 BY SIMILARITY. 

FT CARBOHYD 120 120 N-LINKED (GLCNAC. . .) (POTENTIAL). 



FT CARBOHYD 135 135 N-LINKED ( GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 496 496 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 607 607 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CONFLICT 293 328 CTQ P GQL P FNVI RHAVLL P AD S PTAPH I YAVFT S QW -> 

FT SAPSRGSCPSTSSATRSCSPPILPQLPTSTQSSPPSG (IN 

FT REF. 1) . 

FT CONFLICT 354 354 Y -> F (IN REF. 1) . 

SQ SEQUENCE 761 AA; 83573 MW; 5BCB88 9AA32A2BB3 CRC64 ; 



Query Match 100.0%; Score 4031; DB 1; Length 761; 

Best Local Similarity 100.0%; Pred. No. 9.8e-318; 

Matches 761; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

I I I I i I I I I I I I I I I i I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I 
Db 1 MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKG 60 

Qy 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 12 0 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

Qy 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPST 240 

Qy 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 241 QWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLP 300 

Qy 301 FNVI RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKE 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 FNVI RHAVLL PADS PTAPH I YAVFT SQWQVGGTRSSAVCAFSLLD I ERVFKGKYKELNKE 360 

Qy 361 T S RWTT YRGPETN PRPGS CS VGP S S DKALT FMKDHFLMDEQWGT PLLVKS GVEYTRLAV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 TS RWTT YRGPETN PRPGS CS VGP SS DKALT ETVIKDHFLMDEQVVGT PLLVKS GVEYTRLAV 420 

Qy 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 480 

I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGA 48 0 

Qy 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 VFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMER 540 

Qy 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPE 600 



Qy 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 ASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREH 660 



Qy 661 VKVP LT RVS GGAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS P LRAL RARGKVQ G C E 720 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 
Db 661 VKVP LT RVS GGAALAAQQ S YW PH FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARG KVQ G C E 720 

Qy 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 7 61 

I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TLRPGEKAPLSREQHLQSPKECRTSASDVDADNNCLGTEVA 761 



RESULT 2 
SM4A__MOUSE 

ID SM4A_MOUSE STANDARD; PRT; 7 60 AA. 

AC Q62178; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4A precursor (Semaphorin B) (Sema B) . 

GN SEMA4A OR SEMAB OR SEMB. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NMRI ; TISSUE=Brain; 

RX MEDLINE=9 52 67431; PubMed=774 8561 ; 

RA Pueschel A.W., Adams R.H., Betz H.; 

RT "Murine semaphorin D/collapsin is a member of a diverse gene family 

RT and creates domains inhibitory for axonal extension. 11 ; 

RL Neuron 14:941-948(1995). 

CC -!- FUNCTION: INHIBITS AXONAL EXTENSION BY PROVIDING LOCAL SIGNALS TO 
CC SPECIFY TERRITORIES INACCESSIBLE FOR GROWING AXONS. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED FROM DAY 10 IN THE EMBRYO. LOW 

CC LEVELS FOUND BETWEEN DAYS 10-12. EXPRESSION PEAKS ON DAY 13 WITH 

CC MODERATE LEVELS FROM THEN UNTIL BIRTH. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X85991; CAA59983.1; 

DR PIR; 148745; 148745. 

DR MGD; MGI : 107560; Sema4a. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin__repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 



DR 


SMART; SM00630; 


Sema; 1. 




KW 


Signal; Transmembrane; Immunoglobulin domain; Multigene family; 


KW 


Neurogenesis ; Developmental 


protein; Glycoprotein. 


FT 


SIGNAL 


1 


32 


POTENTIAL. 


FT 


CHAIN 


33 


760 


SEMAPHORIN 4A. 


FT 


DOMAIN 


33 


682 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


683 


703 


POTENTIAL. 


FT 


DOMAIN 


704 


760 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


64 


478 


SEMA. 


FT 


DOMAIN 


496 


547 


PSI. 


FT 


DOMAIN 


572 


630 


IG-LIKE C2-TYPE. 


FT 


DOMAIN 


609 


612 


POLY-LEU. 


FT 


DISULFID 


579 


623 


BY SIMILARITY. 


FT 


CARBOHYD 


120 


120 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


496 


496 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


606 


606 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


760 AA; 83458 MW 


f ; 0A4D6B8O767B910F CRC64; 


Query Match 




81.4%; 


Score 3280.5; DB 1; Length 760; 



Best Local Similarity 82.3%; Pred. No. 4.6e-257; 

Matches 628; Conservative 46; Mismatches 84; Indels 5; Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MALPALGLDPWSLLGLFLFQLLQLLLPT— TTAGGGGQGPMPRVRYYAGDERRALSFFHQ 
I I I I : I I I I II I : I I I I III: : I I I I I I II I I : I : I I I I I I I I I I 
1 MALPSLGQDSWSLLRVFFFQL — FLLPSLPPASGTGGQGPMPRVKYHAGDGHRALSFFQQ 



59 



59 



119 



KGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKK 
llhllllllll I I I I I I I I I I I : I I I : I I : I I : I I I I I I I I I I I I : I I I : I I I I I I I 
KGLRDFDTLLLS DDGNTLYVGARETVLALNI QNPGI PRLKNMI PWPASERKKTECAFKKK 



179 



179 



239 



DPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIP 
11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I 
TLFTSTQAVLVDGMLYSGTMNNFLGSEPILMRTLGSHPVLKTDIFLRWLHADASFVAAIP 



299 



359 



58 



58 



118 



118 



178 



SNETQCFNFI RVLVS YNVTHLYTCGTFAFS PACTFI ELQDS YLLPI SEDKVMEGKGQS PF 

I I I I I I I I I I I I I II II MM I I M I I I I I I M I II I II I II I II I I : II I I II 

119 SNETQCFNFI RVLVSYNATHLYACGTFAFS PACTFI ELQDSLLLPI LI DKVMDGKGQS PL 178 



238 



238 



298 



STQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ 
I I I I I I I I II I I I I I I I I I I M I I I I : II I I I M I II I I I I I II I M II M I i I I I I 
239 STQWYFFFEETASEFDFFEELYISRVAQVCKNDVGGEKLLQKKWTTFLKAQLLCAQPGQ 298 



358 



LP FN VI RHAVLL PADS PTAPH I YAVFT S QWQVGGTRS S AVCAFS LLDI ERVFKGK YKELN 

I I I I : I I II M I I I I II : I I I I I I I I I I I I I M I M I M II I I M I I II I II I I I I 

299 LP FN 1 1 RHAVLL PADS P S VS RI YAVFT S QWQVGGTRS S AVCAFS LTD I ERVFKGK YKELN 358 



418 



KETSRWTTYRGPETNPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRL 

I II II I I I I II I : M II I I I : I I I I I I I I I II II I I I I M I I I I M I I II II II I I I 
359 KETSRWTTYRGSEVSPRPGSCSMGPSSDKALTFMKDHFLMDEHWGTPLLVKSGVEYTRL 418 



Qy 

Db 

Qy 



419 AVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQ 478 

I M : I : I M I I I : I I I I I I : I I I I I I I I II II : II II II I II I I I I I I I I II I 

419 AVESARGLDGSSHWMYLGTSTGPLHKAWPQDSSAYLVEEIQLSPDSEPVRNLQLAPAQ 478 

479 GAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDM 538 
MM I M I I : II II I M II II II II I I I II I II I I I II I I I I I Ml : I I I I I 



UD 


479 


Qy 


R Q Q 


UD 


coo 

538 


Qy 


oyy 


Db 


coo 

598 


Qy 




Db 


658 


Qy 


719 


Db 


718 



479 GAVFAGFSGGIWRVPRANCSVYESCVDCVLARDPHCAWDPESRLCSLLSG-STKPWKQDM 537 

ERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAV 598 
I I 1 I I I I I llhl! 1 II Ihlllll I I I I I I I I I I II I I I I I I : I I I I I : 
ERGNPEWVCTRGPMARSPRRQSPPQLIKEVLTVPNSILELRCPHLSALASYHWSHGRAKI 597 

PEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPR 658 

I II : I I I I I I I II : I I I I I I I I I I I I I I I : I I I I : I I I I I I I I I I I I ! I I I I I : I I 
SEASATVYNGSLLLLPQDGVGGLYQCVATENGYSYPWSYWVDSQDQPLALDPELAGVPR 657 

EHVKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS PLRALRARGKVQG 718 
I I : I I I I I I I I I :: I I I : I I I I I I : I I II hi! I I : I : I I I I I I I I I I I I II 
ERVQVP LT RVG G GASMAAQ RS YW P H FL I VT VLL AI VL L GVLT L L LAS P L GAL RARGKVQG 717 

CETLRPGEKAPLS REQHLQS PKECRTSAS DVDADNNCLGTEVA 761 
I I I I I I I I i I : I I I I I : I I I I I I I I I I I I II III 
CGMLP PREKAPLS RDQHLQ P S KDHRT S AS DVDADNNHLGAEVA 760 



RESULT 3 
SM4B__HUMAN 

ID SM4B_HUMAN STANDARD; PRT; 832 AA. 

AC Q9NPR2; Q8WVP9; Q96FK5; Q9C0B8; Q9H691; Q9NPM8; Q9NPN0; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4B precursor. 

GN SEMA4B OR KIAA1745. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=21082932; PubMed=11214970; 

RA Nagase T., Kikuno R. , Hattori A., Kondo Y. , Okumura K., Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XIX. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 7:347-355(2000). 

RN [2] 

RP SEQUENCE OF 160-832 FROM N.A. , AND VARIANT ALA-792. 

RA Carim L., Estivill X., Escarceller M. , Sumoy L. ; 

RL Submitted (JUL-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 358-832 FROM N.A. 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H. , Ikema Y., Okamoto S., 

RA Okitani R. , Ota T., Suzuki Y., Obayashi M. , Nishi T., Shibahara T., 

RA Tanaka T., Nakamura Y., Isogai T., Sugano S.; 

RT "NEDO human cDNA sequencing project. 11 ; 

RL Submitted (AUG-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE OF 303-832 FROM N.A. , AND VARIANT ALA-792. 

RC TISSUE=Colon, and Pancreas; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 



RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W W Green E. D. f Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2 002). 

CC -!- FUNCTION: INHIBITS AXONAL EXTENSION BY PROVIDING LOCAL SIGNALS TO 
CC SPECIFY TERRITORIES INACCESSIBLE FOR GROWING AXONS (BY 

CC SIMILARITY) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- CAUTION: Ref.3 sequence differs from that shown in position 709 

CC onward due to a frameshift. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AB051532; BAB21836.1; -. 

DR EMBL; AL390080; CAB98204.1; -. 

DR EMBL; AL390081; CAB98205.1; -. 

DR EMBL; AL390082; CAB98206.1; -. 

DR EMBL; AK026133; BAB15372.1; ALT__FRAME . 

DR EMBL; BC010701; AAH10701.1; ALT_INIT . 

DR EMBL; BC017658; AAH17658.1; -. 

DR Genew; HGNC: 10730; SEMA4B. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Transmembrane; Immunoglobulin domain; Multigene family; Neurogenesis; 

KW Developmental protein; Glycoprotein; Signal; Polymorphism. 

FT SIGNAL 1 38 POTENTIAL. 

FT CHAIN 39 832 SEMAPHORIN 4B. 

FT DOMAIN 39 712 EXTRACELLULAR (POTENTIAL) . 



FT 


TRANSMEM 


713 


733 


POTENTIAL. 




FT 


DOMAIN 


734 


832 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


65 


502 


SEMA. 




FT 


DOMAIN 


520 


574 


PSI. 




FT 


DOMAIN 


599 


658 


IG-LIKE C2-TYPE. 




FT 


DOMAIN 


753 


776 


PRO- RICH. 




FT 


DISULFID 


606 


651 


BY SIMILARITY. 




FT 


CARBOHYD 


64 


64 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


91 


91 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


160 


160 


N-LINKED (GLCNAC. 


♦ . ) (POTENTIAL) 


FT 


CARBOHYD 


405 


405 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


520 


520 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


625 


625 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


VARIANT 


792 


792 


S -> A. 




FT 








/FTId=VAR 010758. 




FT 


CONFLICT 


160 


160 


N -> P (IN REF. 2) 




FT 


CONFLICT 


565 


565 


E -> K (IN REF. 4; 


AAH10701) . 


SQ 


SEQUENCE 


832 AA; 


92192 


MW; 29D58C1DDD5E1C6B 


CRC64; 



Query Match 31.5%; Score 1271.5; DB 1; Length 832; 

Best Local Similarity 39.1%; Pred. No. 1.4e-94; 

Matches 304; Conservative 126; Mismatches 273; Indels 75; Gaps 27; 

Qy 10 PWSLL GLFLFQLLQLLL PTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQ 62 

III I I I I I I I III II: III I : : 

Db 10 PWGALPPRPPLLLLLLLLLLLQPPPPTWA LS PRI SLPLGS EERP FLRFEAEHI S 63 

Qy 63 DFDTLLLSGDGNTLYVGAREAILALDIQDPGVP — RLKNMI PWPASDRKKS ECAFKKKSN 120 

: : I I I I I I I I I I I I I II : I I : I : : : | | I I : I : I I I 

Db 64 NYTALLLSRDGRTLYVGAREALFALSSNLSFLPGGEYQELL-WGADAEKKQQCSFKGKDP 122 

Qy 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVM — EGKGQSPF 178 

: I I : I : : I : : : I I : I I I I I I I I I I : I : : : I : I : : I I I : I I 
Db 123 QRDCQNYIKILLPLSGSHLFTCGTAAFSPMCTYINMENFTLARDEKGNVLLEDGKGRCPF 182 

Qy 179 DPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAA— 236 

II I I I : : I I I ||:||:::| |::| : |: :| ||:: I II I : I I I : 
Db 183 DPNFKSTALWDGELYTGTVSSFQGNDPAISRSQSLRPT-KTESSLNWL-QDPAFVASAY 240 

Qy 237 IPST QWYFFFEETAS EFDFFERLHT S RVARVCKNDVGGEKLLQKKWTT FLK 288 

II: : I II I I I I I : I I I I I : I I : I I i I I I : : I I : : I I : I I I 

Db 241 IPESLGSLQGDDDKIYFFFSETGQEFEFFENTIVSRIARICKGDEGGERVLQQRWTSFLK 300 

Qy 289 AQLLCTQPGQ-LPFNVI RHAVLLPADSPTAPH 1 YAVFTSQWQVGGTRS SAVCAFS 342 

I I I II : : I MM:: I II: I I I I I I I I I Mill: 

Db 301 AQLLCSRPDDGFPFNVLQDVFTL S P S P QDWRDTLF YGVFT S QWH RGTT EG S AVCVFT 357 

Qy 343 LLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSCSVGPS SDKALTF 391 

: I : : M I I I I I : I : I I : I I I MUM : MM 
Db 358 MKDVQRVFSGLYKEVNRETQQWYTVTHPVPTPRPGACITNSARERKINSSLQLPDRVLNF 417 

Qy 392 MKDHFLMDEQWGTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGD 451 

: I II II I I II I I : : I MM II I :: I :: M I I I I I II I 

Db 418 LKDHFLMDGQVRSRMLLLQPQARYQRVAVHRVPGLH-HTYDVLFLGTGDGRLHKAVSVG- 475 



Qy 



452 SSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARD 511 
I : : I I : I : I : M : M I : I : : II Ml lllhl M 11*1111 



Db 



476 PRVHIIEELQIFSSGQPVQNLLLDTHRGLLYAASHSGWQVPMANCSLYRSCGDCLLARD 535 



Qy 512 PHCAWDPESRTCCLLSAPNL — NSWKQDMERGNPEWAC-AS GPMS RSLRPQSRPQI I KEV 568 

I : I I I I III | | | : | : : | | | : | | | : : : I 

Db 536 PYCAWSGSSCKHVSLYQPQLATRPWIQDIEGASAKDLCSASSWSPSFVPTGE-KPCEQV 594 

Qy 569 LAVPNSILELPCPHLSALASYYWSHGPAAVPEASS — TVYNGSLLLIVQDGVGGLYQCWA 626 

II:: I II II II: I II ::| : I III: :| :|||: 
Db 595 QFQPNTVNTLACPLLSNLATRLWLRNGAPVNASASCHVLPTGDLLLVGTQQLGE-FQCWS 653 

Qy 627 TENGFSYPVISYWVDSQDQTLALDPELAG-IPREHVKVPLTRVS GGAALAAQQSYW 681 

I I I | | | : : : I : I : I I : : I I I I I : I : I I I 
Db 654 LEEGFQQLVASYCPEWEDGVADQTDEGGSVP VI I STSRVSAPAGGKASWGADRS YW 710 

Qy 682 PHFVTVTVLFAL-VLSGALIILV — AS PLRALRARGKV QGCETLRPGEKAPLS 731 

I :: I I I I I I : I : : : : I : : I : I I I I : 

Db 711 KE FLVMCT L FVLAVLL P VL FLL YRH RN SMKVFLKQGE CAS VH P KT C PWL P P ET RP LN 768 



RESULT 4 
SM4B_MOUSE 

ID SM4B_MOUSE STANDARD; PRT; 782 AA. 

AC Q62179; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4B (Semaphorin C) (Sema C) (Fragment) . 

GN SEMA4B OR SEMAC OR SEMC. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NMRI ; TISSUE=Brain; 

RX MEDLINE=95267431; PubMed=7748561 ; 

RA Pueschel A.W., Adams R.H., Betz H.; 

RT "Murine semaphorin D/collapsin is a member of a diverse gene family 

RT and creates domains inhibitory for axonal extension."; 

RL Neuron 14:941-94 8(1995). 

RN [2] 

RP INTERACTION WITH GIPC. 

RX MEDLINE=99253973; PubMed=10318831 ; 

RA Wang L.-H., Kalb R.G., Strittmatter S.M.; 

RT "A PDZ protein regulates the distribution of the transmembrane 

RT semaphorin, M-SemF."; 

RL J. Biol. Chem. 274:14137-14146(1999). 

CC -!- FUNCTION: INHIBITS AXONAL EXTENSION BY PROVIDING LOCAL SIGNALS TO 

CC SPECIFY TERRITORIES INACCESSIBLE FOR GROWING AXONS. 

CC -!- SUBUNIT: Interacts with GIPC PDZ domain. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED FROM DAY 10 IN THE EMBRYO. LOW 

CC LEVELS FOUND BETWEEN DAYS 10-12. EXPRESSION PEAKS ON DAY 13 WITH 

CC MODERATE LEVELS FROM THEN UNTIL BIRTH. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 



cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; X85992; CAA59984.1; 

DR PIR; 148746; 148746. 

DR MGD; MGI: 107559; Sema4b. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin__repeat . 

DR InterPro; I PRO 01 62 7; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Transmembrane; Immunoglobulin domain; Multigene family; Neurogenesis; 

KW Developmental protein; Glycoprotein. 



FT 


NONJTER 


1 


1 










FT 


DOMAIN 


<1 


662 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


663 


683 




POTENTIAL 






FT 


DOMAIN 


684 


782 




CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


16 


453 




SEMA. 






FT 


DOMAIN 


471 


525 




PSI . 






FT 


DOMAIN 


548 


608 




IG-LIKE C2-TYPE. 




FT 


DOMAIN 


703 


726 




PRO-RICH. 






FT 


DISULFID 


555 


601 




BY SIMILARITY. 




FT 


CARBOHYD 


12 


12 




N-LINKED 


(GLCNAC. 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


15 


15 




N-LINKED 


(GLCNAC. 


. .) (POTENTIAL). 


FT 


CARBOHYD 


42 


42 




N-LINKED 


(GLCNAC. 


. .) (POTENTIAL). 


FT 


CARBOHYD 


88 


88 




N-LINKED 


(GLCNAC. 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


356 


356 




N-LINKED 


(GLCNAC. 


. .) (POTENTIAL). 


FT 


CARBOHYD 


471 


471 




N-LINKED 


( GLCNAC . 


. .) (POTENTIAL). 


FT 


CARBOHYD 


526 


526 




N-LINKED 


( GLCNAC . 


. .) (POTENTIAL). 


FT 


CARBOHYD 


574 


574 




N-LINKED 


(GLCNAC. 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


639 


639 




N-LINKED 


(GLCNAC. 


. . ) (POTENTIAL) . 


SQ 


SEQUENCE 


782 AA; 


86823 MW; 


627A81FC8F8F7AC8 


CRC64; 


Query Match 




29. 


7%; 


Score 1198.5; DB 


1; Length 782; 


Be£ 


st Local Similarity 


39. 


9%; 


Pred. No. 


le-88; 





Matches 275; Conservative 115; Mismatches 244; Indels 55; Gaps 22; 

Qy 49 ERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVP — RLKNMIPWPAS 106 

II: I : : : : I I I I I I I I I I I I I I I : II : : I : : : | | 

Db 1 EERLIRKFEAENISNYTALLLSQDGKTLYVGAREALFALNSNLSFLPGGEYQELL-WSAD 59 

Qy 107 DRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISE 166 

: I : I : I I I : I I : I : : I : I : I I I I I I I I I I I : I : I 

Db 60 ADRKQQCSFKGKDPKRDCQNYIKILLPLNSSHLLTCGTAAFSPLCAYIHIAS FTLAQDEA 119 

Qy 167 DKVM — EGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFL 224 

I : : I I I I I I I 1 I i : : I I I I I : I I : :: I I : : I : I : I : I I I : : I 

Db 120 GNVILEDGKGHCPFDPNFKSTALWDGELYTGTVSSFQGNDPAISRSQSSRPT-KTESSL 178 



Qy 225 RWLHHDAS FVAAI P S TQ WYFFFEETAS E FDFFERLHT S RVARVCKNDVG 274 

II I : I I I : I : : I I I I I I I I : I I I I I I I I I I I I I 

Db 179 NWL-QDPAFVASATSPESLGSPIGDDDKIYFFFSETGQEFEFFENTIVSRVARVCKGDEG 237 

Qy 275 GEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVIRHAVLL PAD S P TAP H I YAVFT S QWQ V 330 

I I : : I i : : I I : I I I I I I I I : : I MM:: | II | I I I I I I 

Db 238 GERVLQQRWTSFLKAQLLCSRPDDGFPFNVLQDVFTLNPNPQDWRKTLSI-GVFTSQWHR 2 96 

Qy 331 GGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSCSVGPS 384 

I I | | : I I : : I : : : I I I I : : I : I I : I I I I I I : I : 

Db 2 97 GTTEGSAI CVFTMNDVQKAFDGLYKKVNRETQQWYTETHQVPTPRPGACITNSARERKIN 356 

Qy 385 S DKALT FMKDH FLMDEQWGT P LLVKS GVEYTRLAVETAQGLDGHS - HLVMYLGT 438 

I : I I : II II I II I I II:: I I : I I II I I : I : : I I I 
Db 357 SSLQLPDRVLNFLKDHFLMDGQVRSRLLLLQPRARYQRVAVHRVPGL — HSTYDVLFLGT 414 

Qy 439 TTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCS 4 98 

I I I I I I I I : : I I : I : I I : I I : I I I I : : II 'II I I I I 

Db 415 GDGRLHKAVTL-SSRVHIIEELQIFPQGQPVQNLLLDSHGGLLYASSHSGWQVPVANCS 473 

Qy 499 VYE S CVDCVLARD PH CAWD PES RTCCLL S APNLN S — WKQDMERGN P EWACAS GPMS RS L 556 

: I : I I I : I I I I I : I I I : I I : I I I I I : I : : I : 

Db 474 LYPTCGDCLLARDPYCAWTGSACRLASLYQPDLASRPWTQDIEGASVKELCKNSSYKARF 533 

Qy 557 RPQSRPQI I KEVLAVPNSI LELPCPHLSALAS YYWSHGPAAVPEAS S — TVYNGSLLLI- 613 

: I I : I I I : : I I I I I I I : I I I I : : I : I I I I : 
Db 534 L VP GK P — C KQ VQ I Q PN T VNT LAC P LL S N LAT RLWVHN GAP VN AS AS C RVL P T GD L L L VG 591 

Qy 614 VQDGVGGLYQCWATENGFSYPVISYWVDSQDQ-TLALDPELAGIPREHVKVPLTRVS 669 

I I : I : : I I I : I I I | | | : : : : : I I I : : I I I 
Db 592 SQQGL- GVFQCWS I EEGFQQLVAS YCPEVMEEGVMDQKNQRDGTP VIINTSRVSAPA 647 

Qy 670 -GGAALAAQQSYWPHFVTVTVL — FALVL 695 

I : I : I I I I : : I I I : I I 

Db 64 8 GGRDSWGADKSYWNEFLVMCTLFVFAMVL 676 



RESULT 5 
SM4C_MOUSE 

ID SM4C_MOUSE STANDARD; PRT; 834 AA. 

AC Q64151; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4C precursor (Semaphorin I) (Sema I) (Semaphorin C-like 1) 

DE (M-Sema F) . 

GN SEMA4C OR SEMAI OR SEMACL1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Neonatal brain; 

RX MEDLINE=95385809; PubMed=7656991 ; 

RA Inagaki S., Furuyama T., Iwahashi Y. ; 

RT "Identification of a member of mouse semaphorin family."; 



RL FEBS Lett. 370:269-272(1995). 

RN [2] 

RP INTERACTION WITH GIPC. 

RX MEDLINE=99253973; PubMed=10318831 ; 

RA Wang L.-H., Kalb R.G., Strittmatter S.M.; 

RT "A PDZ protein regulates the distribution of the transmembrane 

RT semaphorin, M-SemF."; 

RL J. Biol. Chem. 274:14137-14146(1999). 

CC -!- SUBUNIT: Interacts with GIPC PDZ domain. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: EXPRESSED WIDELY IN THE NERVOUS TISSUES DURING 
CC DEVELOPMENT. PREDOMINANTLY IN THE DEVELOPING BRAIN AND SPINAL 

CC CORD . 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; S79463; AAB35184.1; -. 

DR PIR; S66498; S66498. 

DR MGD; MGI: 109252; Sema4c. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 



DR 


SMART; SM00409; 


IG; 1. 




DR 


SMART; SM00423; 


PSI; 1. 




DR 


SMART; SM00630; 


S ema ; 1 . 




DR 


PROSITE; 


PS50835; IG LIKE; 


1. 


KW 


Signal; Transmembrane; Immunoglobulin domain; Multigene family; 


KW 


Neurogenesis; Developmental 


protein; Glycoprotein. 


FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


834 


SEMAPHORIN 4C. 


FT 


DOMAIN 


21 


664 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


665 


685 


POTENTIAL. 


FT 


DOMAIN 


686 


834 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


53 


481 


SEMA. 


FT 


DOMAIN 


499 


552 


PSI. 


FT 


DOMAIN 


557 


645 


IG-LIKE C2-TYPE. 


FT 


DOMAIN 


678 


687 


POLY- LEU. 


FT 


DOMAIN 


709 


774 


PRO-RICH. 


FT 


DOMAIN 


831 


834 


BINDS TO GIPC. 


FT 


DISULFID 


578 


628 


BY SIMILARITY. 


FT 


CARBOHYD 


106 


106 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


121 


121 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


310 


310 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


419 


419 


N-LINKED (GLCNAC. . .) (POTENTIAL) 



FT CARBOHYD 522 522 N-LINKED ( GLCNAC . . .) (POTENTIAL) . 

FT CARBOHYD 565 565 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 834 AA; 92556 MW; 6868BB5BF5714 82D CRC64; 



Query Match 26.8%; Score 1079.5; DB 1; Length 834; 

Best Local Similarity 35.4%; Pred. No. 4.7e-79; 

Matches 271; Conservative 115; Mismatches 242; Indels 137; Gaps 25; 

Qy 39 MPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLK 98 

: I I : I : : I I I : I I I I i I : I I I I I I I I : I : : I : 

Db 28 VP RKT VS S GELVT WRRF S QT GI QD FLT LT LT EH S GLL YVGAREAL FAFS VE ALELQ 84 

Qy 99 NMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGT FAFS PACT FIELQD 158 

III II II I I I I : I : I i I I I I I II -III I I I : I I I I I : I : 
Db 85 GAISWEAPAEKKIECTQKGKSNQTECFNFIRFLQPYNSSHLYVCGTYAFQPKCTYINM— 142 

Qy 159 SYLLPISED KVME GKGQ S P FD PAH KHT AVL VD GML Y S GTMNN FL G S E P I LMRT L G S Q P VL 218 

: : : I I I : I : I I I I I : I I I I I I I I : I I I I I : I I : : : I : I : : 
Db 143 -LTFTLDRAEFEDGKGKCPYDPAKGHTGLLVDGELYSATLNNFLGTEPVILRYMGTHHSI 201 

Qy 219 KTDNFLRWLHHDASFV— AAIPST QWY F F FE ETAS E FD F FERL HT S RVARV 268 

Ih II:: II I : I : : I I I I I I I : I : : I I I I I 

Db 202 KTEYLAFWL-NEPHFVGSAFVPESVGSFTGDDDKIYFFFSERAVEYDCYSEQWARVARV 260 

Qy 269 CKNDVGGEKLLQKKWTTFLKAQLLCTQPG-QLPFNVI RHAVLLPADSPTAPHI YAVFTSQ 327 

I I I : I I : I I I I I i I I I I I : I : I : I : : I I : : I I : I I : : 

Db 261 CKGDMGGARTLQKKWTTFLKARLVCSAPDWKVYFNQLKAVHTLRGASWHNTTFFGVFQAR 320 

Qy 328 WQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSC SV 381 

I I I I I I : I I :: I I : I I I I ::: :| I I :|MIII 
Db 321 W — GDMDLSAVCEYQLEQIQQVFEGPYKEYSEQAQKWARYTDPVPSPRPGSCINNWHRDN 378 

Qy 382 GPSS DKALTFMKDHFLMDEQV VGT P L LVK S GVE YT RLAVET AQ GL D GH S H L V 433 

I : I I I I : I I I I : : I I : I I I I I I : I : : I I I I : : I 

Db 379 GYTSSLELPDNTLNFIKKHPLMEDQVKPRLGRPLLVKKNTNFTHWADRVPGLDGATYTV 438 

Qy 434 MYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVP 493 

: : : I I I I I I I I I : I I I : I : I I I I I : I I : : : : I I : : : 
Db 439 LFIGTGDGWLLKAVSLG-PWIHMVEELQVF-DQEPVESLVLSQSKKVLFAGSRSQLVQLS 496 

Qy 4 94 RANCSVYESCVDCVLARDPHCAWDPESRTCCLLSA PNLNSWKQDMERGN 542 

I : I : I I I I II I I I I I : I II : : I :: I I : : I : I 

Db 4 97 LADCT K YRFCVDCVLARDP YCAWNVNT S RCVATTS GRS GS FLVQHVANLDT S KMCNQ YG- 555 

Qy 543 PEWACASGPMSRSLRPQSRPQIIKEVLAVPNSI LELPCPHLSA-LASYYWSHG 594 

11:1 ::| :| I III IN: II :|: I 

Db 556 IKKVRSIPKNITWSGTDLVLPC-HLSSNLAHAHWTFG 592 

Qy 595 PAAVP — EASSTVYNGSLLLIV QDGVGGLYQCWATENGFSYPVISYW VDSQDQ 645 

: I . : I : I : I : I I I I : I : : I I II I 

Db 593 SQDLPAEQPGSFLYDTGLQALVVTYIAAQSRHSGPYRCYSEEQGTRLAAESYLVAWAGSSV 652 

Qy 64 6 T LAL D P E LAG I P REH VKVP LT RVS G GAALAAQQ S YW P H FVT VT VL FAL VL S GA LI I 701 

II : I I : Ihlllh: 

Db 653 TL EARAPLENLG LVWLAWALGAVCLVLLL 682 

Qy 702 LVAS PLRALRARGKVQGCETLRPGEKA PLSREQHLQSP 739 



Db 683 LVLSLRRRLR EELEKGAKASERTLVYPLELPKEPASP 719 



RESULT 6 
SM4D_M0USE 

ID SM4D_M0USE STANDARD; PRT; 861 AA. 

AC 009126; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4D precursor (Semaphorin J) (Sema J) (Semaphorin C-like 2) 

DE (M-Sema G) . 

GN SEMA4D OR SEMA J OR SEMACL2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=97125976; PubMed=8969198 ; 

RA Furuyama T., Inagaki S., Kosugi A., Noda S., Saitoh S.-I., Ogata M., 

RA Iwahashi Y. , Miyazaki N., Hamaoka T., Tohyama M. ; 

RT "Identification of a novel transmembrane semaphorin expressed on 

RT lymphocytes . " ; 

RL J. Biol. Chem. 271:33376-33381(1996). 

CC -!- FUNCTION: MAY PLAY A FUNCTIONAL ROLE IN THE IMMUNE SYSTEM, AS WELL 
CC AS IN THE NERVOUS SYSTEM. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: STRONGLY EXPRESSED IN LYMPHOID TISSUES, 

CC ESPECIALLY IN THE THYMUS, AS WELL AS IN THE NERVOUS TISSUES. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement . (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U69535; AAC52964.1; 

DR MGD; MGI: 10924 4; Sema4d. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig__c2. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 



DR 


PROSITE; 


PS50835; 


IG LIKE; 1 






KW 


Signal; 


Transmembrane; Immunoglobulin domain; Multigene family; 


KW 


Neurogenesis ; Developmental 


protein; Glycoprotein. 




FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


CHAIN 


24 


861 


SEMAPHORIN 4D. 




FT 


DOMAIN 


24 


733 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


734 


754 


POTENTIAL. 




FT 


DOMAIN 


755 


861 


CYTOPLASMI C ( POTENTIAL ) 




FT 


DOMAIN 


50 


482 


SEMA. 




FT 


DOMAIN 


502 


551 


PSI. 




FT 


DOMAIN 


555 


636 


IG-LIKE C2-TYPE. 




FT 


DISULFID 


576 


624 


BY SIMILARITY. 




FT 


CARBOHYD 


49 


49 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


77 


77 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


139 


139 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


191 


191 


N-LINKED ( GLCNAC . . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


379 


379 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


419 


419 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


613 


613 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


632 


632 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


SQ 


SEQUENCE 


861 AA; 


95714 MW 


; 533CD6D271A6D79B CRC64; 



Query Match 26.3%; Score 1062; DB 1; Length 861; 

Best Local Similarity 39.8%; Pred. No. 1.3e-77; 

Matches 259; Conservative 92; Mismatches 232; Indels 68; Gaps 21 

Qy 15 GLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGN 74 

I I I I I : : I I I I : I I : : I : I I I : | : : : I I : I I : 

Db 9 GLFL — ALVWLRTAVA FAPVPRLTWEHGEV — GLVQFHKPGIFNYSALLMSEDKD 60 

Qy 75 TLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSY 134 

I I I I I I I I I : I : : : : : : : I I : I I I : I I I I I : I : I I : I | | | 
Db 61 TLYVGAREAVFAVNALN — ISEKQHEVYWKVSEDKKSKCAEKGKSKQTECLNYIRVLQPL 118 

Qy 135 NVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP7\HKHTAVLVDGMLY 194 

: I M Ml M I I : I I III I I I : I I I I I I : I : I : I I I I 
Db 119 SSTSLYVCGTNAFQPTCDHLNLTSFKFLGKSED GKGRCPFDPAHSYTSVMVGGELY 174 

Qy 195 S GTMNN FL G S E P I LMRT L G S Q P VL KT DN FL RWLHH DAS FVAA IPSTQWY 244 

Ml I I I I I I I I : I I I : I : : I I : : I I I | | | 

Db 175 SGTSYNFLGSEPIISRNSSHSP-LRTEYAI PWL-NEPSFVFADVIQKSPDGPEGEDDKVY 232 

Qy 245 FFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFNV 303 

I I I I : I : : I : | I t I I I I I I I I : I I I I I I : I I M : I : I : : I I I I : 
Db 233 FFFTEVSVEYEFVFKLMIPRVARVCKGDQGGLRTLQKKWTSFLKARLICSKPDSGLVFNI 292 

Qy 304 IRHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVF-KGKYKE LNK 359 

: : : I I I I I I I I I : I I I I I : : I : I I I : I I I : : : 

Db 293 LQDVFVLRAPGLKEPVFYAVFTP— QLNNVGLSAVCAYTLATVEAVFSRGKYMQSATVEQ 350 

Qy 360 ETSRWTTYRGPETNPRPGSCSVGPS SDKALTFMKDHFLMDEQW GT 405 

I I I I lllhl = I I I I : I II I I I : I 

Db 351 SHTKWVRYNGPVPTPRPGACIDSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNR 410 

Qy 406 PLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPD 465 

MM I I I : : I : I I I I : I I : : | I : I I I I I : I :: I I I I I I 

Db 411 PKLIKKDVNYTQIWDRTQALDGTFYDVMFISTDRGALHKAVIL-TKEVHVIEETQLFRD 469 



Qy 4 66 P E P VRN L Q LAP T Q GAVFV — GFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTC 523 

III Ih : I II I : I I : I I I : II I I II I I I I : I I I I : I 
Db 470 FEPVLTLLLSSKKGRKFVYAGSNSGWQAPLAFCEKHGSCEDCVLARDPYCAWSPAIKAC 529 

Qy 524 CLLSAPNLNS — WKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCP 581 

I = I I ill I I I I : : Ml 

Db 530 VTLHQEEASSRGWIQDM SGDTSSCL— DKSKESFNQHFFKHGGTAELKCF 577 

Qy 582 HLSALASYYW — SHG — PAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATE 628 

III I : I III: I : : I I I : I I I : I 

Db 578 QKSNLARWWKFQNGELKAASPKYGFVGRKHLLIFNLSDGDSGVYQCLSEE 628 

RESULT 7 
SM4D HUMAN 



ID SM4D_HUMAN STANDARD; PRT; 862 AA. 

AC Q92854; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4D precursor (Leukocyte activation antigen CD100) (BB18) 

DE (A8) (GR3). 

GN SEMA4D OR CD100. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=T-cell; 

RX MEDLINE-97030273; PubMed=8876214 ; 

RA Hall K.T., Boumsell L., Schultze J.L., Boussiotis V.A., Dorfman D.M., 

RA Cardoso A. A., Bensussan A. , Nadler L.M,, Freeman G.J.; 

RT "Human CD100, a novel leukocyte semaphorin that promotes B-cell 

RT aggregation and differentiation."; 

RL Proc. Natl. Acad. Sci. U.S.A. 93:11780-11785(1996). 

CC -!- FUNCTION: MAY PLAY A FUNCTIONAL ROLE IN THE IMMUNE SYSTEM, AS WELL 
CC AS IN THE NERVOUS SYSTEM. INDUCES B CELLS TO AGGREGATE AND 

CC IMPROVES THEIR VIABILITY IN VITRO. 

CC -!- SUBUNIT: Homodimer. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: STRONGLY EXPRESSED IN SKELETAL MUSCLE, 

CC PERIPHERAL BLOOD LYMPHOCYTES, SPLEEN, AND THYMUS AND ALSO 

CC EXPRESSED AT LOWER LEVELS IN TESTES, BRAIN, KIDNEY, SMALL 

CC INTESTINE, PROSTATE, HEART, PLACENTA, LUNG, AND PANCREAS BUT NOT 

CC IN COLON OR LIVER. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- DATABASE: NAME=PROW; NOTE=CD guide CD100 entry; 

CC WWW="http : //www . ncbi . nlm. nih . gov/prow/cd/cdl 00 . htm" . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U60800; AAC50810.1; 

DR Genew; HGNC: 10732; SEMA4D. 

DR MIM; 601866; -. 

DR GO; GO: 0006916; P : anti-apoptosis ; TAS. 

DR GO; GO: 0007155; P:cell adhesion; TAS. 

DR GO; GO: 0006955; P:immune response; TAS. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Transmembrane; Immunoglobulin domain; Multigene family; 

KW Neurogenesis; Developmental protein; Glycoprotein. 



FT 


SIGNAL 


1 


21 


POTENTIAL 






FT 


CHAIN 


22 


862 


SEMAPHORIN 4D. 




FT 


DOMAIN 


22 


734 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


735 


755 


POTENTIAL 






FT 


DOMAIN 


756 


862 


CYTOPLASMIC (POTENTIAL) 




FT 


DOMAIN 


50 


482 


SEMA. 






FT 


DOMAIN 


502 


551 


PSI. 






FT 


DOMAIN 


554 


636 


IG-LIKE C2-TYPE. 




FT 


DISULFID 


576 


624 


BY SIMILARITY. 




FT 


CARBOHYD 


49 


49 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


77 


77 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


139 


139 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


191 


191 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


329 


329 


N-LINKED 


(GLCNAC. . . ) 


(POTENTIAL) 


FT 


CARBOHYD 


379 


379 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


419 


419 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


613 


613 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


632 


632 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


SQ 


SEQUENCE 


862 AA; 


96149 


MW; 7B18EFEA98789371 CRC64; 



Query Match 25.9%; Score 1043; DB 1; Length 862; 

Best Local Similarity 35.2%; Pred. No. 4.4e-76; 

Matches 271; Conservative 115; Mismatches 263; Indels 120; Gaps 27; 

Qy 38 PMPRVRYYAGDERRALSF— FHQKGLQDFDTLLLSGDGNTLYVGAREAILALDIQDPGVP 95 

I : I I : : I I : I I : : : : I I I I I : I I I : I i I t I : I : : : : 
Db 26 PIPRITW EHREVHLVQFHEPDIYNYSALLLSEDKDTLYIGAREAVFAVNALN--IS 79 

QY 96 RLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIE 155 

: I I : I I : : I I I I I : I : I I : I I I I : I I I 1 I I I I I I I : 
Db 80 EKQHEVYWKVS EDKKAKCAEKGKS KQTECLN YI RVLQPLSAT S L YVCGTNAFQPACDHLN 139 



Qy 156 LQDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQ 215 



I I : I I I I I : I I I I I I : I : I : I I I I I I ! I I I I I I I I I : I 

Db 140 LTSFKFLGKNED GKGRCPFDPAHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHS 195 

Qy 216 PVLKTDNFLRWLHHDAS FVAA IPST QWYFFFEETASEFDFFERLHTSRV 265 

I I : I : : I I : : I I I I I : I II I I I : I : : | | : | : 

Db 196 P-LRTEYAI PWL-NEPSFVFADVI RKSPDSPDGEDDRVYFFFTEVSVEYEFVFRVLIPRI 253 

QY 266 ARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQ-LPFNVI RHAVLLPADSPTAPHIYAVF 324 

I I I I I I I I : I I I I I I : I I I I : I : I :: I I I I I : I : I : I | I : I 

Db 254 ARVCKGDQGGLRTLQKKWTSFLKARLICSRPDSGLVFNVLRDVFVLRSPGLKVPVFYALF 313 

Qy 325 TSQWQVGGTRSSAVCAFSLLDIERVFK-GKYKE LNKETSRWTTYRGPETNPRPGSCS 380 

I I : i I I I I : : I III III : : : : : I | | | I I I | : | 

Db 314 TP— QLNNVGLSAVCAYNLSTAEEVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACI 371 

Qy 381 VGPS SDKALTFMKDHFLMDEQW GT P L L VK S GVE YT RL AVETAQ GL 426 

: I I I I : I I I I I I : I I hi I I I :: I : II 

Db 372 DSEARAANYTSSLNLPDKTLQFVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIWDRTQAL 431 

Qy 427 DGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQG — AVFVG 4 84 

II : I I : : I I : I I I I : I : : I : : I I I I I I I I I : | | : : | | : | 

Db 432 DGTVYDVMFVSTDRGALHKA- 1 SLEHAVHI I EETQLFQDFEPVQTLLLS S KKGNRFVYAG 490 

Qy 4 85 FSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPE 544 

: I I : I I I : : ! I I I I I I I I : I I I I : | | I | | 

Db 491 S N S GWQAP LAFC GKHGT C ED CVL ARD P YC AW S P P TAT CVALHQT E SPSRGLIQEMS GDA 550 

QY 545 WACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASST 604 

I I : : I I I I I I : I I : I I 

Db 551 SVC PDKSKGSYRQHFFKHGGTAELKCSQKSNLARVFWKF-QNGVLKAESP 599 

Qy 605 VY NGSLLLIVQDGVGGLYQCWATE NGFSYPVISYWVDSQDQTLALDPELAGI 656 

I I: : :| hill : I I : I : : I |: : 

Db 600 K Y GLMGRKN LLIFNLSEGDS G VYQ C L S E E RVKN KT VFQ WAKH V LEVKW 64 9 

Qy 657 P REHVKVP L T RV- S G GAALAAQQ S YW P H FVT VT VL FALVL S GAL 1 1 LVAS PLRA 709 

hi |: | : | : : | : : | | | | | | 

Db 650 PKPWAPTLSWQTEGSRIATK VLVASTQGSSPPTPA 686 

Qy 710 LRARGKVQGCETLRPGEKAP LSREQHLQSPKECRTSASD 748 

: i I I I I I : I I : : I I I : | | 

Db 687 VQATS — SGAITL-PPKPAPTGTSCEPKIVINTVPQLHSEKTMYLKSSD 732 



RESULT 8 
SM4G_MOUSE 

ID SM4G_MOUSE STANDARD; PRT; 837 AA. 

AC Q9WUH7; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4G precursor. 

GN SEMA4G. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
OX NCBI TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=99425180; PubMed=l 04952 81; 

RA Li H., Wu D.K., Sullivan S.L.; 

RT "Characterization and expression of sema4g, a novel member of the 

RT semaphorin gene family."; 

RL Mech. Dev. 87:169-173(1999). 

CC -!- FUNCTION: MAY PLAY A ROLE IN AXON GUIDANCE. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: BRAIN, SPINAL CORD, AND SEVERAL SENSORY ORGANS 
CC AS WELL AS SPECIFIC POPULATIONS OF PROJECTION NEURONS. 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED EARLY IN DEVELOPMENT. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like . C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



DR 


EMBL; AF134 918; 


AAD30541.1; 






DR 


MGD; MGI: 


1347047; Sema4g. 






DR 


InterPro; 


IPR007110; Ig-like. 




DR 


InterPro; 


IPR003599; Ig. 






DR 


InterPro; 


IPR003659; Plexin- 


like. 




DR 


InterPro; 


IPR002165; Plexin 


repeat . 




DR 


InterPro; 


IPR001627; Sema. 






DR 


Pfam; PF01437; 


PSI; 1. 






DR 


Pfam; PF01403; 


Sema; 1. 






DR 


SMART; SM00409; 


IG; 1. 






DR 


SMART; SM00423; 


PSI; 1. 






DR 


SMART; SM00630; 


Sema; 1. 






DR 


PROSITE; 


PS50835; IG LIKE; 1 






KW 


Signal; Transmembrane; Immunoglobulin domain; Multigene family; 


KW 


Neurogenesis ; Developmental 


protein; Glycoprotein. 




FT 


SIGNAL 


1 


17 


POTENTIAL. 




FT 


CHAIN 


18 


837 


SEMAPHORIN 4G. 




FT 


DOMAIN 


18 


673 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


674 


694 


POTENTIAL. 




FT 


DOMAIN 


695 


837 


CYTOPLASMIC (POTENTIAL) 




FT 


DOMAIN 


56 


487 


SEMA. 




FT 


DOMAIN 


505 


556 


PSI. 




FT 


DOMAIN 


565 


647 


IG-LIKE C2-TYPE. 




FT 


DOMAIN 


563 


566 


POLY- PRO. 




FT 


DOMAIN 


762 


773 


POLY- PRO. 




FT 


DISULFID 


582 


630 


BY SIMILARITY. 




FT 


CARBOHYD 


55 


55 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


111 


111 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


126 


126 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


386 


386 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


540 


540 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


596 


596 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 



SQ SEQUENCE 837 AA; 92378 MW; 5C6E9C9C8A545EB1 CRC64; 



Query Match 24.4%; Score 984; DB 1; Length 837; 

Best Local Similarity 34.0%; Pred. No. 2.5e-71; 

Matches 276; Conservative 106; Mismatches 287; Indels 142; Gaps 30; 

Qy 11 WS LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALS F FHQKG-LQD 63 

II M I I I I I I I I : | : Ml | : 

Db 2 WGRLWPLLFSFL TVTAVPGPSLRRPS-RELDATPRLTISYEELSQIRHFKGQTQN 55 

QY 64 FDTLLLSGDGNTLYVGAREAILAL DIQDPGVPRLKNMIPWPASDRKKSECAFKKKSN 120 

: I I I I I I I I I I :: I I I : I I I I II : I : I I I : I 
Db 56 YSTLLLEEASERLLVGARGALFSLSARDIRD RTHKEIHWEASPEMQSKCHQKGKNN 111 

QY 121 ETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDP 180 

s I : I I I : I I I I I I I II I I I I I : : : : : I | | : I I I : I : | | 
Db 112 QTECFNHVRFLQRLNATHFYACGTHAFQPLCAAID-AETFILPTSFE EGKEKCPYDP 167 

Qy 181 AH KHT AVLVD GML Y S GTMNN FL G S E P I LMRT L G S Q P VL KT DN FL RWLHH DAS FV 234 

I I : : : M I I : I I I I : | : : : : | | : | | I I 

Db 168 ARGFTGLIIDGGLYTATRYEF-RSIPDIRRSRHPHSLRTEEAPMHWL-NDAEFVFSVLVR 225 

QY 235 AAI P STQWYFFF EETAS EFD FFERLH-T S RVARVCKNDVGGEKLLQKKWTT F 286 

: I : : I I I I I I : I I I : I I I I I I I I : I I : I : I I I I | | : | 

Db 22 6 ESKTSAVGDDDKIYFFFMEREEGSSSFTQSRSSHRVARVARVCKGDLGGKKILQKKWTSF 2 85 

Qy 287 LKAQLLCTQPGQLPFNVIRHAVLLPADSPTAPHIYAVF — TSQWQVGGTRSSAVCAFSLL 344 

111:1:1 I : : I III:: I I I II I : I I : : I I : I : | 

Db 286 L KARL I CHIP QYETLRGVCSLNADTSSHTHFYAVFTLTTQWKT — LEASAICRYDLA 340 

Qy 345 DIERVFKGKYKELNKETSRWTTYRGPETNPRPGSC SVGPSSDK ALTFMK 393 

: I : I I I : I II II II I I I I I I : I : | | : | 

Db 341 EIQAVFTGPFMEYQDGARRWGRYEGGVPEPRPGSCITDSLRSRGYNSSQDLPSLVLDFVK 4 00 

Qy 394 DHFLMDEQW GTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSG 450 

Ml II I I I I : I I II I | : : : : : | M I : I I I I I I 

Db 401 LHPLMARPWPTRGRPLLLKRNVRYTHLTGTHVSTPAGPTYDLLFLGTADGWIHKAWLG 460 

Qy 451 DSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLAR 510 

I I : : II I I : I : I : I II : : I I : : : | | : | | : | : : | | | : | | | | : I I I 
Db 461 -SGMHIIEEIQVFREPQSVDNLVISPMQHSLWGAASGVLQFPLSSCSRYQSCYDCILAR 519 

QY 511 DPHCAWDPESRTCCL-LSAPNLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVL 569 

11:111 I : : I Ihllll I II | | :| 
Db 520 DPYCGWDSSIHACMVATTVANRTELIQDIERGNR — GCEG SRDAGP PPPLKTRS 571 

Qy 570 AVPNSILELPCPHLSALASYYW-SHGPAAVPEASSTVYNGSLLLIVQDGV GGLYQCW 625 

: : Ml Ml I :| :: : | |:| | | | |; 

Db 572 VLRGDDVLLPCDQPSNLARALWLLNGSKSLSDGQDGYRVGVDGLLVTDTQLEHSGNYGCY 631 

Qy 626 ATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSGGAALAAQQSYWPHFV 685 

I Ml : I I : : | | | : | I I I I I : 

Db 632 AEENGLRMLLAS YS LTVRPAT PAPAPQAPAT P GAQLA HDM 671 

Qy 68 6 TVTVL FAL VL S GAL 1 1 L VAS P L RALRARGKV 716 

: : I : : I I : : : | | I I I I I I 

Db 672 RMFYWAI AI LGGLCLI LAS S LL YVACLKGGRRGRRRKYS LGRAGRAGGSAVQLQTVS GQ 731 



Qy 717 QGCETLRPGEKAP 729 

II : I I I I I 

Db 732 CPGEEDEGDDGEGTGGLESGCLQIIPGEGAP 762 

RESULT 9 
SM4G HUMAN 



ID SM4G_HUMAN STANDARD; PRT; 838 AA. 

AC Q9NTN9; Q9HCF3; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4G precursor. 

GN SEMA4G OR KIAA1619. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Heath P. ; 

RL Submitted (MAY-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20450683; PubMed=10997877 ; 

RA Nagase T., Kikuno R. , Nakayama M. , Hirosawa M. , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. 

RT XVIII. The complete sequences of 100 new cDNA clones from brain which 

RT code for large proteins in vitro."; 

RL DNA Res. 7:273-281(2000). 

CC FUNCTION: MAY PLAY A ROLE IN AXON GUIDANCE (BY SIMILARITY). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AL133215; -; NOT_ANNOTATED_CDS . 

DR EMBL; AB046839; BAB13445.1; ALT_INIT. 

DR Genew; HGNC: 10735; SEMA4G. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 



DR 


SMART; SM00630; Sema; 1. 






DR 


PROSITE; 


PS50835; 


IG LIKE; 1 






KW 


Signal; 


Transmembrane; Immunoglobulin domain; Multigene family; 


KW 


Neurogenesis ; Developmental 


protein; Glycoprotein . 




FT 


SIGNAL 


1 


17 


POTENTIAL. 




FT 


CHAIN 


18 


838 


SEMAPHORIN 4G. 




FT 


DOMAIN 


18 


675 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


676 


696 


POTENTIAL. 




FT 


DOMAIN 


697 


838 


CYTOPLASMI C ( POTENTIAL ) 




FT 


DOMAIN 


56 


489 


SEMA. 




FT 


DOMAIN 


507 


558 


PSI. 




FT 


DOMAIN 


567 


649 


IG-LIKE C2-TYPE. 




FT 


DOMAIN 


565 


568 


POLY- PRO. 




FT 


DOMAIN 


763 


774 


POLY-PRO. 




FT 


DISULFID 


584 


632 


BY SIMILARITY. 




FT 


CARBOHYD 


55 


55 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


111 


111 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


126 


126 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


388 


388 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


542 


542 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


598 


598 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CONFLICT 


543 


543 


R -> RSQGSR (IN REF. 2) 




SQ 


SEQUENCE 


838 AA; 


91496 MW 


; 9B281AEE8681F245 CRC64; 



Query Match 23.8%; Score 958; DB 1; Length 838; 

Best Local Similarity 34.7%; Pred. No. 3.2e-69; 

Matches 259; Conservative 101; Mismatches 255; Indels 132; 



Gaps 



27 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 



57 HQKG-LQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAF 115 

III I :: I I I I I I I I I I : : I II | I I I I : | : | 

48 HFKGQAQNYSTLLLEEASARLLVGARGALFSLSANDIGDGAHKE-IHWEASPEMQSKCHQ 106 



175 



116 KKKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQ 
I I : ! : I : I I I : I I I I I I I I I I I I I I I : ::: | | | : Ml: 
107 KGKNNQTECFNHVRFLQRLNSTHLYACGTHAFQPLCAAID-AEAFTLPTSFE EGKEK 162 



176 SPFDPAHKHTAVLVDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFV- 234 

I'lll I :::|| II: I | I I : I : : : : || :|| || 

163 CPYDPARGFTGLIIDGGLYTATRYEF-RSIPDIRRSRHPHSLRTEETPMHWL-NDAEFVF 220 



235 



279 



AAIPSTQWYFFFEETASE FDFFERLH-TSRVARVCKNDVGGEKLL 

: I : I I : I I I I : I I | : I I I I I I I I : I I : I : I 

221 SVLVRESKASAVGDDDKVYYFFTERATEEGSGSFTQSRSSHRVARVARVCKGDLGGKKIL 280 

280 QKKWTTFLKAQLLCTQPGQLPFNVI RHAVLLPADSPTAPHIYAVFT — SQWQVGGTRSSA 337 
I I I M : I I I I : I : I I : : | I I : : : I II II : I I : : I I 

281 Q KKWT S FLKARL I CHI P L YET LRGVC S LDAET S S RTH F YAAFT L S T QWKT — LEAS A 335 



338 



336 



388 



396 



VCAFSLLDIERVFKGKYKELNKETSRWTTYRGPETNPRPGSC SVGPSSDK 387 

; l • I =1: II I I I : II I I Mill! I I : I : 

ICRYDLAEIQAVFAGPYMEYQDGSRRWGRYEGGVPEPRPGSCITDSLRSQGYNSSQDLPS 395 

-ALTFMKDHFLMDEQW GTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSL 443 

11:1111 II I I I I : I : I | | I : : : : : I I I I : 

LVLDFVKLHPLMARPWPTRGRPLLLKRNIRYTHLTGTPVTTPAGPTYDLLFLGTADGWI 455 



444 HKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESC 503 



Db 


456 


Qy 


504 


Db 


515 


Qy 


563 


Db 


567 


Qy 


622 


Db 


612 


Qy 


664 


Db 


666 


Qy 


708 


Db 


712 



Mill I I MHI Ml : : I II :: I :::|| M ::| ::|| I I 



VDCVLARDPHCAWDPESRTCCLLSA-PNLNSWKQDMERGNPEWACASGPMSRSLRPQSRP 562 

MMIIMM III : I : I : ||:|||| I I II | | 
YDCILARDPYCGWDPGTHACAAATTIANRTALIQDIERGNR — GCES SRDTGP P 566 

QIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLIVQDGVGGL- 621 

M : : : MM I II II 

PPLKTRSVLRGDDVLLPCDQPSNLARALW LLNGSMGL — SDGQGGYR 611 

YQCWATENGFS YPVI S YWVDSQDQTLALDPELAGI PREHVKV 663 

I Ml III : | | : : | | | : | 
VGVDGLLVT DAQ P EH S GN YGC YAEENGLRT LLAS YS LTVRPAT P AP AP KAP AT P 665 

P L T RVS G GAALAAQ Q S YW P H FVT VT VL FAL VL S GAL 1 1 L VAS P L 707 

MM I : : | : I :::|| 
GAQLAPD VRLLYVLAI AALGGLCLI LAS SLLYVACLREGRRGRRRK 711 



I II I 



RESULT 10 
SM4 F__HUMAN 

ID SM4 F_HUMAN STANDARD; PRT; 77 0 AA. 

AC 095754; Q9NS35; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Semaphorin 4F precursor ( Semaphorin W) (Sema W) (Semaphorin M) (Sema 

DE M) . 

GN SEMA4F OR SEMAW OR SEMAM. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM SMALL) . 

RC TISSUE=Brain; 

RX MEDLINE=99162633; PubMed=10051670; 

RA Encinas J. A. , Kikuchi K., Chedotal A., de Castro F. , Goodman C.S., 

RA Kimura T. ; 

RT "Cloning, expression, and genetic mapping of Sema W, a member of the 

RT semaphorin family."; 

RL Proc. Natl. Acad. Sci. U.S.A. 96:2491-2496(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM LONG) . 

RA Chen F., Do A., Do T., Meisler M. , Roe B.A.; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM LONG) . 

RC TISSUE=Amygdala; 

RX MEDLINE=21154917; PubMed=11230166; 

RA Wiemann S., Weil B., Wellenreuther R., Gassenhuber J., Glassl S., 

RA Ansorge W. , Boecher M. , Bloecker H., Bauersachs S., Blum H., 



RA Lauber J., Duesterhoeft A., Beyer A., Koehrer K., Strack N., 

RA Mewes H.-W., Ottenwaelder B., Obermaier B., Tampe J., Heubner D., 

RA Wambutt R. , Korn B., Klein M. , Poustka A.; 

RT "Towards a catalog of human genes and proteins: sequencing and 
RT analysis of 500 novel complete protein coding human cDNAs . " ; 
RL Genome Res. 11:422-435(2 001). 
RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM SHORT) . 
RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L . H , , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenrnen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A.A. , Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy s!j., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. A Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 347-770 FROM N.A. 

RA Jang W. , Spilson S.V., Hua A. , Roe B., Meisler M.H.; 

RT "Large-scale comparative sequence analysis of human and mouse genomic 

RT DNA in the mnd2 region of mouse chromosome 6 reveals coding regions of 

RT three new genes."; 

RL Submitted (MAR-1998) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Has growth cone collapse activity against retinal 

CC ganglion-cell axons (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC - !- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name~Long; 

C C Isold=095754-1; Sequence=Displayed; 

CC Name=Short; 

CC IsoId=095754-2; Sequence=VSP_006043 ; 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC SIMILARITY: Contains 1 Sema domain. 

CC SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR 
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DR 
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DR 
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DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



EMBL; AB022317; BAA75631.1; -. 

EMBL; AC006544; -; NOT_ANNOTATED_CDS . 

EMBL; AL136552; CAB66487.1; 

EMBL; BC018361; AAH18361.1; 

EMBL; AF053369; AAF80660.1; -. 

Genew; HGNC: 10734; SEMA4F. 

MIM; 603706; 

GO; GO: 0005887; C: integral to plasma membrane; TAS . 

GO; GO: 0005624; C:membrane fraction; TAS. 

GO; GO: 0007411; P:axon guidance; TAS. 

GO; GO:0007267; P:cell-cell signaling; TAS. 

GO; GO: 0007399; P : neurogenesis ; TAS. 

InterPro; IPR003659; Plexin-like. 

InterPro; IPR002165; Plexin_repeat . 

InterPro; IPR001627; Sema. 

Pfam; PF01437; PSI; 1. 

Pfam; PF01403; Sema; 1. 

SMART; SM00423; PSI; 1. 

SMART; SM00630; Sema; 1. 

Signal; Transmembrane; Immunoglobulin domain; Multigene family; 
Neurogenesis; Developmental protein; Glycoprotein; 
Alternative splicing. 

SIGNAL 1 34 POTENTIAL. 

CHAIN 35 770 SEMAPHORIN 4F. 

DOMAIN 35 659 EXTRACELLULAR (POTENTIAL). 

TRANSMEM 660 68 0 POTENTIAL. 

DOMAIN 681 770 CYTOPLASMIC (POTENTIAL). 

DOMAIN 65 4 96 SEMA. 

DOMAIN 512 563 PSI. 

DOMAIN 580 635 IG-LIKE C2-TYPE . 

DISULFID 587 628 BY SIMILARITY. 

CARBOHYD 64 64 N-LINKED (GLCNAC. . .) (POTENTIAL). 

CARBOHYD 133 133 N-LINKED (GLCNAC. . .) (POTENTIAL). 

CARBOHYD 509 509 N-LINKED (GLCNAC. . .) (POTENTIAL). 

VARSPLIC 120 274 Missing (in isoform Short) . 

/FTId=VSP_006043. 
CONFLICT 533 533 S -> N (IN REF. 1). 

SEQUENCE 770 AA; 83511 MW; CFBB74B4 1DF0E9C8 CRC64; 



Qy 

Db 

Qy 

Db 

Qy 
Db 



Query Match 22.8%; Score 919.5; DB 1; Length 770; 

Best Local Similarity 32.8%; Pred. No. 3.6e-66; 

Matches 250; Conservative 109; Mismatches 309; Indels 95; Gaps 24; 

19 FQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYV 78 
1 I I I : : | : | | : | s | : : | | : : I I I I 

20 FPLLLLAVLSGPVSGRVPRSVPRTSLPISEADSCLTRFAVPHTYNYSVLLVDPASHTLYV 79 

79 GAREAI LALDIQDPGV- PRLKNMI PWPASDRKKSECAFKKKSNETQCFNFI RVLVS YNVT 137 

111= I II : I II || : : | || | :| ||:::| I : 
8 0 GARDTIFALSLPFSGERPR RIDWMVPEAHRQNCR-KKGKKEDECHNFVQILAIANAS 135 

138 HLYTCGT FAFS PACT FI ELQDS YLL PIS EDKVMEGKGQS P FD PAHKHTAVLVDGML YS GT 197 

I I M I I I I I I I | :: : :: : | : | : | | : I I : I I : I : I I : I 
136 HLLT C GT FAFD P KC GVI DVS RFQQ V ERLE S GRGKC P FE P AQ RS AAVMAGGVL YAAT 191 



Qy 



198 MNNFLGSEPILMRTLG-SQPVLKTDNFLRWLHHDASFVAAIPSTQV VYFF 24 6 



UD 




: 1 : M : M 1 : 1 : 1 : : : : 1 1 i 1 : 1 1 II 1 : : : | | | 
VKNYLGTEPIITRAVGRAEDWIRTDTLPSWLNAPA-FVAAVALSPAEWGDEDGDDEIYFF 


250 


Qy 


247 


FEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLFCAQLLCTQP — GQLPFNVI 
III- M : 1 1 : 1 1 1 1 1 1 1 : II 1 1 1 :: I I I | | | 1 1 1 1 1 | : : 
FTETSRAFDSYERIKVPRVARVCAGDLGGRKTLQQRWTTFLKADLLCPGPEHGRASSVLQ 


304 


T\"U 

UD 


251 


310 


Qy 


305 


RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRW 

MM 1 1 :| : II 1 : 1 1 1 I I I I | | | | :: | | s : I 
DVAVLRP ELGAGTPI FYGI FS SQWE — GATI SAVCAFRPQDI RTVLNGP FRELKHDCNRG 


364 


Db 


311 


368 


Qy 


365 


TTYRGPET-NPRPGSC S VGP S S DKALT FMKDHFLMDEQ W GTP LLV 

: 1 1 1 1 1 II 1 : 1 II :: 1 1 | | | | | | || 1 
LPWDNDVPQPRPGECITNNMKLRHFGSSLSLPDRVLTFIRDHPLMDRPVFPADGHPLLV 


409 


Db 


369 


428 


Qy 


410 


KSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPV 
: II: II : 1 : 1 1 1 1 1 1 1 : 1 1 1 : : : | : : | | | : | : | | 
TTDTAYLRWAHRVTSLSGKEYDVLYLGTEDGHLHRAVRIG-AQLSVLEDLALFPEPQPV 


469 


Db 


429 


487 


Qy 


470 


RNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAP 

1 : : 1 -M 1 : 1 || : 1 1 : 1 : 1 1 : I I I 1 1 I : | 
ENMKL YHSWLLVGSRTEVTQWTTNCGRLQSCSECILAQDPVCAWSFRLDE-CVAHAG 


529 


Db 


488 


544 


Qy 


530 


NLNSWKQDMERGNPEWACAS GPMSRSLRPQS RPQI I KEVLAVPNS I LELPCPHLSALAS Y 

1 1 : 1 : 1 | | | : : | | : : I I I II II 
EHRGLVQDIESADVSSLCPK EPGERP-WFEVPVATAAHWLPCSPSSAWASC 


589 


Db 


545 


596 


Qy 


590 


YWSHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISY WVDSQDQT 


646 


Db 


597 


1 1 1: 1 : 1 I ::| | | | | I | :: | :| | :| 
VW-HQPSGV — TALTPRRDGLEWWPGAMGAYACECQEGGAAHVVAAYSLVWGSQRD — 


651 


Qy 


647 


LALD P ELAG I P REHVKVP LT RVS GGAALAAQQ S YWPH FVT VT VL FALVL S GAL 1 1 LVAS - 
1 : II i 1 1 • 1 • • 1 • • • 


705 


Db 


652 


i * \ i \ \ \ i i • • i • • i 

AP S RAHT VGAGLAG FFLGI LAAS LTLILIGR 


682 


Qy 


706 


PLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECR 743 
1 1 1 1 1 1 1 I : : 1 1 : : I 
RQQRRRQRELLARDKVGLDLGAPPSGTTSYSQDPPSPSPEDER 725 




Db 


683 





RESULT 11 
SMZ7_BRARE 

ID SMZ7_BRARE STANDARD; PRT; 766 AA. 

AC Q9YHX4; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin Z7 precursor (Semaphorin 7) (Sema-Z7) . 

GN SEMAZ7 OR SEMA7 . 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprini formes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99060909; PubMed=9867349 ; 



RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



Halloran M.C., Severance S.M., Yee C.S., Gemza D.L., Kuwada J.Y.; 
"Molecular cloning and expression of two novel zebrafish 
semaphorins . "; 

Mech. Dev. 76:165-168(1998). 
!- SUBCELLULAR LOCATION: Type I membrane protein. 
!- SIMILARITY: Belongs to the semaphorin family. 
SIMILARITY: Contains 1 Sema domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF073289; AAC72345.1; 

ZFIN; ZDB-GENE-990715-7; sema7. 

InterPro; IPR007110; Ig-like. 

InterPro; IPR003599; Ig. 

InterPro; IPR003659; Plexin-like. 

InterPro; IPR002165; Plexin_repeat . 

InterPro; IPR001627; Sema. 

Pfam; PF00047; ig; 1. 

Pfam; PF01437; PSI; 1. 

Pfam; PF01403; Sema; 1. 

SMART; SM00409; IG; 1. 

SMART; SM00423; PSI; 1. 

SMART; SM00630; Sema; 1. 

PROSITE; PS50835; IG_LIKE; 1. 

Signal; Transmembrane; Immunoglobulin domain; Multigene family; 
Glycoprotein. 

POTENTIAL. 
SEMAPHORIN Z7. 
EXTRACELLULAR (POTENTIAL) . 
POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
SEMA. 
PSI. 

IG-LIKE C2-TYPE. 
BY SIMILARITY. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 



SIGNAL 
CHAIN 
DOMAIN 
TRANSMEM 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DISULFID 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
SEQUENCE 



1 

25 

25 
665 
686 

53 
501 
555 
577 

52 
433 
564 
612 
766 AA; 



24 
766 
664 
685 
766 
483 
552 
640 
623 

52 
433 
564 
612 

85617 MW; 



0ACBC693FE7D830C CRC64; 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



Query Match 22. 
Best Local Similarity 34. 
Matches 250; Conservative 



7%; Score 917; DB 1; 
1%; Pred. No. 5.8e-66; 
126; Mismatches 258; 



Length 766; 
Indels 100; 



Gaps 29; 



QY 
Db 



12 SLLGLFLFQLLQLLLPTTTAGGGGQ — GPMPRVRYYAGDERRALSFFHQKGLQDFDTLLL 69 
HI: : I : I III : I I | I I : : | : : : | : | | 

3 SLLAVLC VLYVWSPAMLTGGLGSTLDSLPRKTVPIGSNGGRL — FREEGIWNYTTMLL 58 



QY 70 S G D GNT L YVGARE AI L AL D I Q D P GVP RL KNMI P WP AS D RKK S E C AFKKK SNETQCFNFIR 129 



I I I : I I I I I I I I I : I : I |: I : | | | | : || 

Db 59 RDDLNLLILGAREAIFALDLDD— ITIKKAMLKWEVTRDQQNDCSNKGKDATNDCKNYIR 116 

Qy 1 30 VLVSYNVTHLYTCGTFAFSPACTFIELQDSYL-LPISEDKVMEGKGQSPFDPAHKHTAVL 188 

• I I : I I I I I I : I I :: I III:: : I I : I I I I :: | : : 
Db H7 ILHKKNDGRMYVCGTKAFNPTCGYLSYADGKLTLEILQE DTKGKCPFDPFQRYTSAM 173 

Qy 189 VDGMLYSGTMNNFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFV— AAIPSTQ 241 

Ml III II I I I I : : | | : : : : : | : | | s s I : III | 
Db 174 VDGAYYSATSMNFRGSEPVMMRS — TEESIRTEFTSTWL-SEPNFIHMAHIPEGQSNPDG 230 

Qy 242 — WYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG- 297 

: I I I I I I I : : : : : I I I I I I I I I : I I :: I || I I I : I I I I : I I | 
Db 231 DDDKIYLFFSETAVEYESYTKVDVSRVARVCKGDLGGQRTLQKKWTSFLKARLDCQVPNT 2 90 

Qy 298 Q L P FN VI RH AVL L PAD S P TAP H I YAVFT S QWQ VGGT RS S AVC AF S L L D I E RVF- KGK Y K- 355 

H : : : I I I I I I I I I : : I II I : : : I I : I I | | | : | 
Db 291 NLPL-LVQDVFHLCPDDWTTCVFYAVFTP — QSDS SQYSAVCS YKI EDI KTVFS KGKFKA 34 7 

Qy 356 ELNKETS RWTTYRGPETNPRPGSC SVGPS — SDKALTFMKDHFLMDEQV 402 

INI :| II :IM |:| : | || | |:|| II: 

Db 348 PFNVETSFVKWVMYSGELPDPRPGACIDNHAREKGITKSLELPDKTLQFVKDKPLMDQAV 4 07 

^ 403 V G-TPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQ 4 61 

I I I I I I : I I : I I I I : I I I I I : : I I : I I : I I I : : : : || I I 
Db 408 TAEQPLLVKRGAAFTRIVVTTATALNGTSHQVMFIGTKSGSVLKA-VNYNGEMVIMEEIQ 4 66 

Qy 4 62 LFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVYESCVDCVLARDPHCAWDPESR 521 

H I I :: I : I : :: :: | | | | :: : | I : : I : I I I I I I I I I I I I : : 
Db 467 LFDPSEPI KI LRLS S SKKQLYVGS EVGWQLS I S ECGRYQTCLDCVLARDPHCGWDLDTE 526 

^y 52 2 TCCLLSA PNLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQI I KEVLAVPNS I LEL 578 

I : : : : : I : | : | | | : | . ■ . . | 

Db 527 HCATINSIHRTRSSTVIQSLNDGDASQCPAIG VSKP VNISFYHGNTVKL 575 

Qy 579 PCPHLSALASYYWSHGPAAVPEASS TVYNGSLLLIVQDGVGGLYQCWATENG 630 

I III I : ::: ::| | | I I I : I 

Db 576 GCQPYSNLAQVKWQFNGEPIKPSNTIQILSDGLMIFNASL DATGYYTCSSIET- 628 

Qy 63 1 FSYPVISYWVDS QDQTLALDPELAGIPREHVKVPLTRVSG-GAALAAQQSYWPHFVT 686 

I I I :| I I : II I:| : | 

Db 62 9 VSQRKYQTQHVAYD VKMWS ES GTTAS LHHVKEKERT LVA 667 

QY 687 VTVLFALVLSGALI 700 

: I : : I I I : II 
Db 668 MWILSLVLAALLI 681 



RESULT 12 
SM4F__MOUSE 

ID SM4F_MOUSE STANDARD; PRT; 777 AA. 

AC Q9Z123; Q9R1Y1; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 4F precursor (Semaphorin W) (Sema W) . 

GN SEMA4F OR SEMAW. 



OS 

oc 
oc 
ox 

RN 

RP 

RC 

RX 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 



Goodman C.S. 



Mus musculus (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=BALB/c, and 129/SvJ; TISSUE^Brain; 
MEDLINE=99162633; PubMed=10051670; 

Encinas J. A. , Kikuchi K., Chedotal A., de Castro F. 
Kimura T . ; 

"Cloning, expression, and genetic mapping of Sema W, a member of the 
semaphorin family. "; 

Proc. Natl. Acad. Sci. U.S.A. 96:2491-2496(1999). 
-!- FUNCTION: Has growth cone collapse activity against retinal 
ganglion-cell axons (By similarity) . 
SUBCELLULAR LOCATION: Type I membrane protein. 
SIMILARITY: Belongs to the semaphorin family. 
SIMILARITY: Contains 1 Sema domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use ^ by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 



EMBL; 
EMBL; 



EMBL; AB021291; BAA75630.1; 
EMBL; AB022316; BAA75634.1; 
EMBL; AB022311; BAA75634.1; JOINED. 

AB022312; BAA75634.1; JOINED. 
AB022313; BAA75634.1; JOINED. 
EMBL; AB022314; BAA75634.1; JOINED. 
EMBL; AB022315; BAA75634.1; JOINED. 
MGD; MGI: 1340055; Sema4f. 
InterPro; IPR003659; Plexin-like. 
InterPro; IPR002165; Plexin_repeat . 
InterPro; IPR001627; Sema. 
Pfam; PF01437; PSI; 1. 
Pfam; PF01403; Sema; 1. 
SMART; SM00423; PSI; 1. 
SMART; SM00630; Sema; 1. 

Signal; Transmembrane; Immunoglobulin domain; Multigene family; 
Neurogenesis; Developmental protein; Glycoprotein. 

POTENTIAL. 
SEMAPHORIN 4F. 
EXTRACELLULAR (POTENTIAL) . 
POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
SEMA. 
PSI. 

IG-LIKE C2-TYPE. 
BY SIMILARITY. 

N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 



FT 


SIGNAL 


1 


40 


FT 


CHAIN 


41 


777 


FT 


DOMAIN 


41 


667 


FT 


TRANSMEM 


668 


688 


FT 


DOMAIN 


689 


777 


FT 


DOMAIN 


71 


502 


FT 


DOMAIN 


518 


569 


FT 


DOMAIN 


586 


641 


FT 


DISULFID 


593 


634 


FT 


CARBOHYD 


70 


70 


FT 


CARBOHYD 


139 


139 


FT 


CARBOHYD 


515 


515 



FT VARIANT 
FT VARIANT 
SQ SEQUENCE 



490 490 P -> T (IN STRAIN 129/SVJ) . 

659 659 S -> A (IN STRAIN 129/SVJ) . 

777 AA; 84501 MW; B63F853355856924 CRC64; 



Query Match 22.4%; Score 904; DB 1; Length 777; 

Best Local Similarity 33.0%; Pred. No. 6.6e-65; 

Matches 251; Conservative 107; Mismatches 312; Indels 90; Gaps 25; 

QY 21 LLQLLLPTTTAGGGGQGP — MPRVRYYAGDERRALSFFHQKGLQDFDTLLLSGDGNTLYV 78 

II III : I I : I : I I : | : | : : I I : : I I I I 

Db 26 LLLLLLAMLSAPVCGRVPRSVPRTSLPISEADSYLTRFAAPHTYNYSALLVDPASHTLYV 85 

79 GAREAILALDIQDPG-VPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIRVLVSYNVT 137 
I I I : : I I I : I II II : : | | | I : I I I I : : I I : 

Db 86 GARDSIFALTLPFSGEKPR— RIDWMVPETHRQNCR-KKGKKEDECHNFIQILAIANAS 141 

Qy 138 HLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDPAHKHTAVLVDGMLYSGT 197 

N M I I I I I I I I:: : ::: |:|: ||:|| : ||: |:||: | 
Db 142 HLLTCGTFAFDPKCGVIDVSSFQQV ERLESGRGKCPFE P AQ R S AAVMAG GVL YT AT 197 

Qy 198 MNNFLGSEPI LMRTLG- SQPVLKTDNFLRWLHHDAS FVAAI PSTQWYFF 246 

: I I I I : I I I : I : I : : : : | : | | : | | | | | : : : | | 

Db 198 VKNFLGTEPIISRAVGRAEDWIRTETLSSWLNAPA-FVAAMVLSPAEWGDEDGDDEIFFF 256 

Qy 247 FEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQP — GQLPFNVI 304 

Ml: I : II : I I I I I I I : I I I I I :: I I II I I I III | | : : 
Db 257 FTETSRVLDSYERIKVPRVARVCAGDLGGRKTLQQRWTTFLKADLLCPGPEHGRASGVLQ 316 

Qy 305 RHAVLLPADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRW 364 

M I I : I : I I I : I I I I I I I II I | : : | | : : | 

Db 317 DMTELRPQPGAGTPLFYGIFSSQWE— GAAISAVCAFRPQDIRAVLNGPFRELKHDCNRG 374 

Qy 365 TTYRGPET-NPRPGSC SVGPS SDKALTFMKDHFLMDEQW GTPLLV 409 

I MIM II I : II I :: I I I I I I I I I I I 

Db 375 LPVMDNEVPQPRPGECITNNMKFQQFGSSLSLPDRVLTFIRDHPLMDRPVFPADGRPLLV 434 

Qy 410 KSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPV 469 

: II: M : I : I I I I I I I : I I I : : : | : : | | | : | : | | 

Db 435 TTDTAYLRWAHRVTSLSGKEYDVLYLGTEDGHLHRAVRIG-AQLSVLEDLALFPEPQPV 493 

Qy 470 RNLQLAPTQGAVFVGFSGGWRVPRANCSVYESCVDCVLARDPHCAWDPESRTCCLLSAP 529 

— I Ml I :| :|| : | ! : | : | | : 1 | [ | | | : | 

Db 494 ESMKL— YHDWLLVGSHTEVTQVNTSNCGRLQSCSECILAQDPVCAWS-FRLDACVAHAG 550 

Qy 530 NLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILELPCPHLSALASY 589 

I I : I : I I I : : : | :: | | | || | I 

Db 551 EHRGMVQDIESADVSSLCPK EPGEHPWFEVPVATVGHW-LPCSPSSAWASC 602 

Qy 590 YWSHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISYWVDSQDQTLAL 649 

I I I : I II I :: I I I I I I I : I : I : | 

Db 603 VW- HQ P S GV — T S LT P RRD GL EVWT P GAMGAYAC E CQ E GGAARWAAY SLVW 652 

Qy 650 DPELAGI PREHVKVPLTRVSGGAALAAQQS YWPHFVT VTVLFALVLSGALI ILVAS 705 

: I I I III I ||: :| :|: 

Db 653 GSQRGPSNRAHTW GAGLVG FFLGVLAAS LTLLLI GRRQQ 692 

Qy 706 --PLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECR 743 



Db 



I I I I I I I I : : I I : : I 

693 RRRQRELLARDKVGLDLGAPPSGTTSYSQDPPSPSPEDER 732 



RESULT 13 
SM4F RAT 



ID 

AC 

DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RC 

RX 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 



- ! - 



SM4F_RAT STANDARD; PRT; 776 AA. 

Q9Z143; 

30-MAY-2000 (Rel. 39, Created) 

30-MAY-2000 (Rel. 39, Last sequence update) 

10-OCT-2003 (Rel. 42, Last annotation update) 

Semaphorin 4F precursor (Semaphorin W) (Sema W) . 

SEMA4F. 

Rattus norvegicus (Rat) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus 
NCBI_TaxID=10116; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=Sprague-Dawley; TISSUE=Brain; 
MEDLINE=99162 633; PubMed=10051670; 

Encinas J. A. / Kikuchi K. , Chedotal A., de Castro F. , Goodman C.S., 
Kimura T . ; 

"Cloning, expression, and genetic mapping of Sema W, a member of the 
semaphorin family."; 

Proc. Natl. Acad. Sci. U.S.A. 96:2491-2496(1999). 
-!- FUNCTION: Has growth cone collapse activity against retinal 
ganglion-cell axons. 

SUBCELLULAR LOCATION: Type I membrane protein. 

TISSUE SPECIFICITY: EXPRESSED AT LOW LEVELS IN THE DEVELOPING 
EMBRYO. EXPRESSED AT HIGH LEVELS IN THE ADULT CENTRAL NERVOUS 
SYSTEM AND LUNG. 

SIMILARITY: Belongs to the semaphorin family. 
SIMILARITY: Contains 1 Sema domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AB002563; BAA7562 9.1; 
InterPro; IPR003659; Plexin-like. 
InterPro; IPR002165; Plexin_repeat . 
InterPro; IPR001627; Sema. 
Pfam; PF01437; PSI; 1. 
Pfam; PF01403; Sema; 1. 
SMART; SM00423; PSI; 1. 
SMART; SM00630; Sema; 1. 

Signal; Transmembrane; Immunoglobulin domain; Multigene family; 

Neurogenesis; Developmental protein; Glycoprotein. 

SIGNAL 1 39 POTENTIAL. 

CHAIN 4 0 776 SEMAPHORIN 4F. 

DOMAIN 4 0 665 EXTRACELLULAR (POTENTIAL) . 



FT 


TRANSMEM 


666 


686 


POTENTIAL. 


FT 


DOMAIN 


687 


776 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


70 


501 


SEMA. 


FT 


DOMAIN 


517 


568 


PSI. 


FT 


DOMAIN 


585 


640 


IG-LIKE C2-TYPE. 


FT 


DISULFID 


592 


633 


BY SIMILARITY. 


FT 


CARBOHYD 


69 


69 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


138 


138 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


514 


514 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


776 AA; 


84298 


MW; 20763182CC6C93CA CRC64; 



Query Match 22.4%; Score 901.5; DB 1; 

Best Local Similarity 33.0%; Pred. No. l.le-64; 
Matches 255; Conservative 108; Mismatches 308; 



Length 776; 
Indels 101; 



Gaps 



27 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



10 PWSLLGLFLFQLLQLLLPTTTAGGGGQGP— MPRVRYYAGDERRALSFFHQKGLQDFDTL 67 

I III Ml :l I: I :|| : I: I :: | 

23 PLSLL LLLAILSAPVCGRVPRSVPRTSLPISEADSYLTRFAASHTYNYSAL 73 

68 LLSGDGNTLYVGAREAILALDIQDPGV-PRLKNMIPWPASDRKKSECAFKKKSNETQCFN 126 

I = : I I I I I I I :: I I I : I II II : : | | | | : | | 
74 LVDPASHTLYVGARDSIFALTLPFSGERPR RIDWMVPETHRQNCR-KKGKKEDECHN 129 

127 FIRVLVSYNVTHLYTCGTFAFSPACTFIELQDSYLLPISEDKVMEGKGQSPFDPAHKHTA 186 

I I 2 : I 1:11111111111 I : : : : : : | : | : | | : | | : | 
130 FI Q I LAI VNAS HLLT CGT FAFD P KC GVI DVS S FQQV ERLESGRGKCPFEPAQRSAA 185 

187 VL VD GML Y S GTMNN FL G S E P I LMRT L G - S Q P VL KT DN FL RWLHH DAS FVAAI 237 

h I : I I : I : I I I I : I I I : I : I : : : : I : I | : I I I | | : 

186 VMAGGVLYTATVKNFLGTE P IIS RAVGRAEDWI RTETL S SWLNAPA- FVAAMVL S PAEWG 244 

238 — PSTQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQ 2 95 

"INN: I : I I : MINI I : I I I I I :: I I I I M I I I I 
245 DEDGDDEIFFFFTETSRVLDSYERIKVPRVARVCAGDLGGRKTLQQRWTTFLKADLLCPG 304 

296 P GQLPFNVI RHAVLLPADS PTAPHI YAVFTSQWQVGGTRS SAVCAFSLLDI ERVFKGK 353 

I I : : Ml I I : I : I I I : I I I I I II II I I 

305 PEHGRAS GVLQAMAELRPQPGAGTPIFYGIFSSQWE — GAAISAVCAFRPQDIRAVLNGP 362 

354 YKELNKETSRWTTYRGPET-NPRPGSC S VGPS SDKALTFMKDHFLMDEQ 4 01 

: : I I : : I | | | | | | || I : I I I :: I I | I I 

363 FRELKHDCNRGLPVMDNEVPQPRPGECIANNMKLQQFGSSLSLPDRVLTFIRDHPLMDRP 422 

402 W— GTPLLVKSGVEYTRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAHLVE 458 

I Mill: II: || : I : M I I I I I : I I I : : : | 

423 VFPADGRPLLVTTDTAYLRVVAHRVTSLSGKEYDVLYLGTEDGHLHRAVRIG-AQLSVLE 481 

459 EIQLFPDPE P VRN LQ LAP T Q GAVFVG F S GG VWRVP RAN C S VYE S C VD CVL ARD PH CAWD P 518 

:: I I I : I : I I ::: I : | | | : | : || : II : I : I I : I I I I I 

482 DLALFPEPQPVESMKL — YHDWLLVGSHTEVTQVNTSNCGRLQSCSECILAQDPVCAWS- 538 

519 ESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVPNSILEL 578 

1=1 I I : I : I | I : : : | :: | 

539 FRLDACVAHAGEHRGMVQDIESADVSSLCPK E P GEH P WFE VP VAT VGH W- L 590 

579 PCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPVISY 638 
N Mlllll:| : I I ::| I I I I I I : I :| 



Db 



591 P C S P S S AWAS C VW- H Q P S GV T ALT P RRD GL E VWT P GAMGAYAC E CQ EGGAARWAAY 647 



Qy 639 WVD S Q DQT LAL D P E LAGI P REH VKVP LT RVS GGAALAAQQ S YWP H FVT VT VL FAL- VL S G 697 

= I : III III Mill: 
Db 64 8 SLVWGSQRGPSNRAHTW GAGL VGFLLGVLAA 679 

698 ALIILVAS PLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECR 743 

M :|: I I II II I | :: || :: | 

Db 68 0 SLTLLLIGRRQQRRRQRELLARDKVGLDLGAPPSGTTSYSQDPPSPSPEDER 731 



RESULT 14 
SZ1A_BRARE 

ID SZ1A_BRARE STANDARD; PRT; 8 60 AA. 

AC Q9W7J1; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin Z1A precursor (Semaphorin 1A) (Sema-ZIA) . 

GN SEMAZ1A OR SEMA3AA. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=99313409; PubMed=1038 683 8 ; 

RA Yee C.S., Chandrasekhar A., Halloran M.C., Shoji W., Warren J.T., 

RA Kuwada J.Y.; 

RT "Molecular cloning, expression, and activity of zebrafish semaphorin 

RT Zla."; 

RL Brain Res. Bull. 48:581-593(1999). 

CC -»- FUNCTION: MAY INFLUENCE OUTGROWTH BY A VARIETY OF GROWTH CONES 
CC INCLUDING THOSE OF THE POSTERIOR LATERAL LINE GANGLION. 

CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). 

CC DEVELOPMENTAL STAGE: EXPRESSED IN HIGHLY SPECIFIC PATTERNS WITHIN 

CC THE DEVELOPING EMBRYO. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

cc 

DR EMBL; AF086761; AAD43964.1; -. 

DR ZFIN; ZDB-GENE-991209-3 ; sema3aa. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 



KW 
KW 



DR Pfam; PF00047; ig; 1. 

DR Pfam; PF014 03; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

Signal; Immunoglobulin domain; Multigene family; Neurogenesis; 
Developmental protein; Glycoprotein. 

FT SIGNAL 1 17 POTENTIAL. 

FT CHAIN 18 860 SEMAPHORIN Z1A. 

FT DOMAIN 241 539 SEMA. 

FT DOMAIN 579 668 IG-LIKE C2-TYPE . 

FT DOMAIN 722 858 ARG/LYS-RICH (BASIC) . 

FT DISULFID 652 717 BY SIMILARITY. 

FT CARBOHYD 53 53 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 126 126 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 593 593 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 860 AA; 97263 MW; 5FD4C12194F5165C CRC64; 

Query Match 21.8%; Score 880.5; DB 1; Length 860; 

Best Local Similarity 29.8%; Pred. No. 6.1e-63; 

Matches 239; Conservative 139; Mismatches 322; Indels 103; Gaps 25 

QY 13 LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQD FDTLLL 69 

I : I : I I I : I I | : : | | : : : : : | | : : | | | : 

Db 4 LVGIFLL-LCGVALPGRVAPQHTKENVPRLKLSYNEMLESSNLVTFTGLANSSGYDTFLM 62 

QY 70 SGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQCFNFIR 129 

I : I I I I : : : I : : : | I III: I : I I : I I I I : | 

Db 63 DGERGRLLVGAEDHVFSFDLVN--INRDVKQIAWPATPSKRDECKWAGKDLRKDCSNFVR 12 0 

Qy 130 VLVSYNVTHLYTCGTFAFSPACTFIEL— QDSYLLPISEDKVMEGKGQSPFDPAHKHTA 186 

I I I I I I I : I I I I I I I | : | : | : : : : : | : | : | | : | | : : : 
Db 121 VLQS YNQTHI YI CGTGAFHPI CS FLEMGKRAEDNI FRLDANYFENGRGKS PYDPKMQS S S 180 

QY 187 VL VD GML Y S GTMNN FL G S E P I LMRT L G S Q P VL KT DN F- L RWLHH DAS FV AAIP 238 

•1:1111111 : I : I : : I I I | I : : I : III:: I : : | 

Db 181 LLLDGELYSGTSADFMGRDFAIFRTLGSHHPIRTEQHDSRWL-NEPRFLGIHLIPESDNP 239 

Qy 239 STQWYFFFEETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG- 297 

: : I I : I I : : : I I : : : I I I I : I I : I I I M I I I I : I I : I I 
Db 240 EDDKIFLFFKENAMDGEHTGKATISRIGQLCKNDMGGHRSLVNKWTTFLKAKLTCSVPGL 299 

QY 2 98 QL P FNVI RHAVLL PAD S PTAPH I YAVFT S QWQVGGT RS S AVCAFS LLD I ERVFKGK Y 354 

I : : : I : I I I I I I I I I : : I I I : I : I : I I I I I I I 
D b 300 NGIDTHFDELQDVFLMSAKDPKNPVIYAVFTTSSNI — FRGSAICMYSMADIRRVFLGPY 357 

Qy 355 KELNKETS RWTTYRGPETNPRPGS C SVGPSSDKALTFMKDHFLMDEQW-- 403 

• : I I I I I I : I I I : I I : I | I 

Db 358 AH RDGPN YQWVP FQGRVP YP RP GT C P S KT FGGFD S T KDL P DDVI T FARLH PAMYN P VQ PM 417 

QY 4 04 -GTPLLVKSGVEY — TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAWSGDSSAH 455 

I I : : I : : I I I I : I I : : II : I I : : | | | : : | | | | 

Db 418 GGKPIWRTNVEYQFTQLVVDRVEAEDG-QYDWIFIGTDLGTVLKAA/TIPRESWHDLEEV 476 



Qy 



456 LVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRANCSVY-ESCVDCVLARDPHC 514 
: : I I : : I : I I : : : I : | : : : | I : : : | I I I : : I : I I I I I I : I 



Db 


477 


VLEEMTVFREPTPITAMELSTKOOOLYLGSDLGISOMPLHRrFAA^rKAraFrrT ADnDvr 


JOO 


Qy 


515 


AWDPESRTCCLLSAPNL— NSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEVLAVP 
IN II 1 : : | | : | : | | : : | | . . , , 
AWD GTECSRYFPTAKRRTRRODIRNGDPLSOCSDLHHNnnT.FnYqqvp-FRq^/vrA/p 


R 1 O 


Db 


537 




Qy 


573 


NSILELPCPHLSALASYYWS HGPAAVPEAS STVYNGSLLLIVQDGVGGLYQC 

M : 1 I 1 1 1 1 : | | ||:: |:: | 
NSSMFLECSPKSORALIYWOLOKPNDERKHETVTDF.RT.m.Tr^nr:! t tdqt Tnan^ruT?! n 


DZ 4 


Db 


593 


DOZ 


Qy 


625 


WATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRV-SGGAALAAQQSYW — 
1 1 : 1 1 1 : I | | : | ||: | . | 


681 


Db 


653 


699 


Qv 


682 


P H FVT VT VL FALVL S GAL 1 1 LVAS P L RAL RARGKVQ GC ET LRP G E 
l-l : 1 : : 1 1 : 1 I | 
DFMSLLEHPDLNSVDEFCERIWK REKKPKGK — KAPKVNPGTGVSIKNE 


726 


Db 


700 


746 


Qy 


727 


KAPLSREQHLQSPKECRTSASDV 749 
II: 1 1 1 : 1 : : I | 
KTPQTTAQSLQNPTQRAQNAPKV 769 




Db 


747 





RESULT 15 
SM3F_MOUSE 

ID SM3F__M0USE STANDARD; PRT; 785 AA. 

AC 088632; 088633; 
DT 30-MAY-2000 (Rel. 39, Created) 
DT 30-MAY-2000 (Rel. 39, Last sequence update) 
DT 15-MAR-2004 (Rel. 43, Last annotation update) 
DE Semaphorin 3F precursor (Semaphorin IV) (Sema IV) . 



GN SEMA3F. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS A AND B) . 

RX MEDLINE-99090943; PubMed=9 875739 ; 

RA Eckhardt F. , Meyerhans A. ; 

RT "Cloning and expression pattern of a murine semaphorin homologous to 

RT H-sema IV. 

RL NeuroReport 9:3975-3979(1998). 

RN [2] 



RP SEQUENCE FROM N.A. (ISOFORM A) . 

RC STRAIN=FVB/N ; TISSUE=Colon; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T . , Max S.I., Wang J. , Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 



RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibfos R.A., 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). 
CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=B; 

CC Isold=088632-1; Sequence=Displayed; 

CC Name=A; 

CC IsoId=088632-2; Sequence=VSP__006042 ; 

CC -!- TISSUE SPECIFICITY: Expressed ubiquitously in adulthood. During 
CC embryogenesis, expressed in subregions of the central nervous 

CC system and various other tissues like skin, kidney, lung and 

CC intestine. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF080090; AAC28108.1; 

DR EMBL; AF080091; AAC28109.1; 

DR EMBL; BC010976; AAH10976.1; -. 

DR MGD; MGI: 1096347; Sema3f. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Immunoglobulin domain; Multigene family; Glycoprotein; 

KW Alternative splicing. 

FT SIGNAL 1 18 POTENTIAL. 

FT CHAIN 19 785 SEMAPHORIN 3F. 

FT DOMAIN 272 569 SEMA. 

FT DOMAIN 605 695 IG-LIKE C2-TYPE. 

FT DOMAIN 758 779 ARG/LYS-RICH (BASIC) . 

FT DOMAIN 715 718 POLY- PRO. 

FT DISULFID 678 746 BY SIMILARITY. 

FT CARBOHYD 53 53 N-LINKED (GLCNAC. . .) (POTENTIAL). 



FT CARBOHYD 126 126 N-LINKED (GLCNAC. . . ) (POTENTIAL). 

FT VARSPLIC 153 183 Missing (in isoform A) . 

F T /FTId=VSP_006042 . 

SQ SEQUENCE 785 AA; 88493 MW; 89DBA7A08D02A72E CRC64; 

Query Match 20.9%; Score 841; DB 1; Length 785; 

Best Local Similarity 29,6%; Pred. No. 8.3e-60; 

Matches 249; Conservative 120; Mismatches 289; Indels 182; Gaps 35; 

Qy H WS — LLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQK 59 

I'll- Hill | | | | Ml | 

Db ^ WAS L LT GAW PAT P I Q DQ L PAT PRVR LSFKELKATGTAHFFN 49 

®Y 6 ° GLQDFDTLLLSGDGNTLYVGAREAI LALDIQDPGVPRLKNMIPWPASDRKKSECAFK 116 

I = M I : : I I I : : : : I : I I : I : | : I I I I : : | | 
Db 50 FLLNTTDYRILLKDEDHDRMYVGSKDYVLSLDLHD — INREPLIIHWAASPQRIEECILS 107 

Qy H7 KKSNETQCFNFIRVLVSYNVTHLYTCGTFAFSPACTFIEL 156 

I : I I I : I : : : I I I I I I I I I : : | | | : : 

Db 108 GKDGNGECGNFVRLIQPWNRTHLYVCGTGAYNPMCTYVNRGRRAQALPWTQMQWRGRGS 167 

Qy 157 QDS YLLP I S EDKVMEGKGQS PFDPAHKHTAVLVDGMLYS GTMNNFL 202 

: I : : : I : I I I : I : I I : I : : I I : I : | : 

Db 168 RATDGADRPTPTAPRQDYIFYLEPEKLESGKGKCPYDPKLDTASALINEELYAGVYIDFM 227 

Qy 203 GSEPILMRTLGSQPVLKTDNF-LRWLHHDASFVAA — IPST QWYFFFEETASEF 254 

I : : : I M I I : : I I : I I I : I I I : I I I : : I I I I I : : I 

Db 228 GTDAAI FRTLGKQTAMRTDQYNSRWL-NDPSFIHAELIPDSAERNDDKLYFFFRERSAEA 2 86 

Qy 255 DFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPG QLP FNVI RHAVLL 310 

: : : I : I : I I I I I I I I : I I I M : I : I : I I : | : : : : 

Db 287 PQNPAVY-ARIGRICLNDDGGHCCLVNKWSTFLKARLVCSVPGEDGIETHFDELQDVFVQ 345 

Qy 311 PAD S PTAPHI YAVFT SQWQVGGT RS S AVCAFS LLDI ERVFKGKYKELNKET S RWTT YRGP 370 

I I I I I I I I I I I I I I : | : | | Ml: : | : | 

Db 346 QTQDI RNP VI YAVFT S S GS V — FRGSAVCVYSMADI RMVFNGP FAHKEGPN YQWMP FS GK 403 

Qy 371 ETNPRPGSCSVG PS S DKALT FMKDHFLMDEQW GT PLLVKS GVE Y — 415 

lllhl I II I: : II: I II : I l|:|::| I 

Db 404 MPYPRPGTCPGGTFTPSMKSTKDYPDEVINFMRTHPLMYQAVYPLQRRPLWRTGAPYRL 463 

Qy 416 TRLAVETAQGLDGHSHLVMYLGTTTGSLHKAW — SGDSSAH — LVEEIQLFPDPEPVRN 471 

l-ll: II : I : : I I I I : : I : I | : : I I : : : I : I I I : 

Db 464 TTVAVDQVDAADGR-YEVLFLGTDRGTVQKVIVLPKDDQEVEELMLEEVEVFKEPAPVKT 522 

Qy 472 LQLAPTQGAVFVGFSGGWRVPRANCSVY-ESCVDCVLARDPHCAWDPESRTCCLLSAPN 530 

: • : : : : I : I I : I I : I II I I I I I : I I I I : I : I : 
Db 523 MTISSKRQQLYVASAVGVTHLSLHRCQAYGAACADCCLARDPYCAWD — GQACSRYTASS 580 

Qy 531 -LNSWKQDMERGNPEWACASGPMSRSLRPQSRPQIIKEV-LAVPNSILELPCPHLSALAS 588 

I : I I : I I I I I : :: I II I I | I : 

Db 581 KRRSRRQDVRHGNPIRQC RG FN SNAN KNAVE S VQ YGVAG S AAFL ECQ P RS P QAT 634 

Qy 589 YYW — SHGPA AVPEASSTVYNGSLLLIVQDGVGGLYQCWATENGFSYPV — ISY 638 

I I : : I I I : I I I I I I I I I I I : I : 

Db 635 VKWLFQRDPSDRRREIRAEDRFLRTEQGLLLRALQLGDRGLYSCTATENNFKHIVTRVQL 694 



Qy 639 WVDSQDQT-LALDPELAGIPREHVKVPLTRVSGG AALAAO OS 67Q 

1 :| 11 1 'I I H =1 I I II I 

Db 695 HVLGRDAVHAALFPPLA VSVPPPPGTGPPTPPYQELAQLLAQPEVGLIHQYCQG 748 

QY 680 YWPHFVTVTVLFALVLSGALI I LVAS PLRALRARGICVQGCETLRPGE KAPLSREQH 735 

Db 749 ™ R i : I I I I I I I | I I : I I 

749 YWRH VPPSPREAPGA LRPPELQDQKKPRNRRHH 781 



Search completed: May 5, 2004, 17:29:11 
Job time : 20 sees 



